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#T266  in  the  saturation  stage.  We  shall  (i)obtain  numerical  results  for  the  operating 
characteristics;  and  (ii)  compare  these  with  the  known  experimental  characteristics. 

B)  We  shall  extend  the  above  studies  to  the  saturation  stage  of  relativistic, 
electromagnetic,  cylindrical  crossed--field  devices,  particularly  the  A6  magnetron.  We 
shall  (i)obtain  results  for  the  operating  characteristics;  and  (ii)  seek  to  detail  and 
understand  how  each  of  the  relativistic,  and  electromagnetic,  and  cylindrical  effects 
affect  the  resulting  density  profiles  and  other  operating  characteristics  in  this  stage. 

C)  We  shall  continue  other  studies  that  have  potential  interest  to  the  AFOSR,  such  as  in 
the  areas  of  nonlinear  optics,  instabilities  in  higher  order  systems,  the  development  of  a 
quantitative  measurement  of  variational  results,  and  precursor  formation  in  the 
ionosphere. 
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Summary  of  Accomplishments: 

1 )  Saturated  stage  of  magnetrons  and  CFAs:  .The  main  thrust  of  this  contract  was 
to  theoretically  study  the  various  stages  of  high  power  magnetrons  (HPM)  and 
cross-field  amplifiers  (CFA)  by  means  of  the  cold-fluid  plasma  equations.  The 
work  accomplished  on  this  project  has  been  published  in  PubOl  and  Pub03  listed 
below,  which  are  a  conference  proceedings  followed  by  publication  in  a  standard 
journal.  Crossed-field  devices  have  basically  two  main  stages:  first  there  is  the 
initiation  stage  wherein  the  power  is  turned  on  and  the  energy  output  of  the 
device  ramps  up  from  zero  towards  its  maximum  output.  Second  is  the 
saturation  stage  wherein  the  device  has  achieved  the  maximum  power  output. 
The  growth  in  the  initiation  stage  is  due  to  the  excitation  of  linear  instabilities. 

The  instabilities  must  vanish  when  the  saturation  stage  is  achieved,  which  is  a 
stationary  state.  When  the  above  theoretical  results  were  used  in  the  design  of  a 
Fortran  code  to  calculate  this  saturated  stage,  it  was  found  that  regardless  of  the 
approach  used  (WKB  expansions,  direct  numerical  integrations,  etc.),  unwanted 


instabilities  invariably  occurred.  First,  a  review  of  the  theory  in  these 
publications  revealed  that  the  theory  was  defective,  in  that  it  attempted  to  solve 
the  problem  of  the  saturated  stage  in  an  unstable  manner.  The  approach  that 
was  used  was  exactly  the  same  approach  which  had  worked  successfully  for 
solving  the  initiation  stage:  separating  the  problem  into  an  rf  part  and  a  dc  part, 
then  numerically  iterating  back  and  forth  between  them  until  a  solution  was 
achieved.  But  what  happens  when  one  reaches  the  saturation  stage  is  that  the 
interaction  between  the  rf  wave  and  the  dc  background  becomes  so  strong  that 
one  cannot  treat  these  two  modes  separately.  Rather,  one  must  solve  the 
coupled  system  as  a  single  unit  because  small  changes  in  one  mode  can 
generate  larger  changes  in  the  other.  The  difficulty  here  is  that  the  coupled 
system  then  becomes  equivalent  to  a  15th  order  set  of  ordinary  differential 
equations  (ODEs)  which  are  singular  ODEs.  They  are  singular  because  some  of 
the  highest  derivatives  in  these  ODEs  are  multiplied  by  small  quantities.  Whence 
any  error  in  the  evaluation  of  these  derivatives  can  become  magnified  as  one 
integrates  forward.  This  set  also  has  various  boundary  conditions  to  be  set  at  the 
cathode  and  also  at  the  anode.  Whence  this  problem  then  becomes  a  two-point 
boundary  problem.  In  short,  this  is  the  worst  of  all  possible  cases  to  attempt  to 
solve.  And  all  attempts  by  this  PI  to  numerically  integrate  this  15th  order  system 
have  not  been  successful.  Let  us  note  that  in  the  initiation  stage,  one  has  only  a 
fourth-order  system  of  ODEs  for  the  rf  mode,  another  second-order  system  of 
ODEs  for  the  dc  mode  and  none  of  these  ODEs  are  singular.  This  reduction  this 
reduction  in  the  order  occurs  because  one  has  assumed  that  certain  algebraic 
relations  are  satisfied  between  the  physical  quantities.  These  initiation  ODEs 
can  be  successfully  numerically  integrated  mainly  due  to  being  only  6th  order. 
However  the  jump  from  6th  order  to  15th  order  requires  a  much  more  careful 
formulation  of  the  numerical  approach  in  order  to  succeed.  We  note  that  at  this 
point,  we  have  not  been  able  to  analyze  whether  the  instabilities  involved  are 
numerical  instabilities  or  actual  physical  instabilities  coming  from  the  cold-fluid 
equations.  If  one  is  not  to  be  able  to  numerically  integrate  these  ODEs,  then  it  is 
necessary  to  search  for  another  approach.  There  are  variational  methods  that 
could  possibly  work.  There  are  methods  that  use  single  particle  orbits.  The 
same  is  done  in  PIC  codes,  but  there  one  only  solves  for  the  particle  orbits  and 
averages  appropriately  to  get  the  macroscopic  quantities  such  as  density  and 
average  velocities.  Our  approach  has  been  to  solve  for  that  stationary  state 
wherein  the  device  is  delivering  maximum  power,  if  such  a  state  exists.  Near  the 
end  of  the  contract,  it  was  realized  that  there  is  another  approach  which  would  be 
somewhere  between  the  sole  use  of  PIC  codes  and  fully  solving  the  appropriate 
ODEs.  This  would  be  a  hybrid  method,  whereby  one  would  calculate  the  single 
particle  orbits  in  an  assumed  stationary  dc  electric  field  and  an  assumed 
stationary  rf  electromagnetic  field.  From  a  knowledge  of  the  single  particle  orbits 
in  these  fields,  one  can  then  obtain  the  macroscopic  density  and  the  average 
particle  velocity,  if  such  a  stationary  state  could  exist.  If  it  could  exist,  then  linear 
stability  theory  could  be  used  to  study  its  stability  features. 

2)  The  Camassa-Holm  Equation:  Certain  problems  can  have  instabilities  which  are 
related  to  the  presence  of  eigensolutions  of  the  perturbed  equations  whose 
analytical  properties  are  not  spatially  uniform.  This  shows  up  in  integrable 
systems  in  that  the  eigenvalue  problem  of  the  Lax  pair  will  have  this  nature.  One 
example  of  this  is  the  Camassa-Holm  equation  (CH)  when  the  initial  data  is 
sufficiently  strong.  Other  examples  are  certain  degenerate  parametric  optical 


laser  problems  such  as  degenerate  two-photon  propagation.  A  very  good  reason 
to  study  the  CH  equation  first,  instead  of  the  optical  problems,  is  that  the  CH 
equation  has  the  advantage  it  lacks  many  of  the  complexities  found  in  these 
optical  problems.  Thus  its  results  can  therefore  be  potentially  used  as  a  model 
for  understanding  how  to  treat  these  optical  problems.  In  Pub02,  we  carry  out  an 
analysis  of  this  feature  when  the  initial  data  for  the  CH  equation  is  sufficiently 
strong  and  is  on  an  infinite  spatial  interval.  We  show  that  one  can  solve  this 
problem  if  the  infinite  interval  is  broken  into  semi-infinite  and  finite  regions,  inside 
of  which,  each  region  contains  its  own  collection  of  scattering  data.  Given  the 
scattering  data  in  these  regions,  we  then  demonstrate  that  one  can,  in  principle, 
reconstruct  the  full  solution  on  the  infinite  interval. 

3)  In  Pub04,  we  have  shown  how  one  could  solve  the  problem  of  the  estimation  of 
the  accuracy  of  the  solution  of  a  variational  problem.  Variational  methods  have 
often  been  successful  in  solving  complicated  problems.  And  the  method  is 
usually  more  accurate  that  one  can  justify.  We  note  that  a  variational  formalism 
does  exist  for  the  HPM  problem  [D.  J.  Kaup  and  Gary  E.  Thomas,  J.  Plasma 
Physics  57,  765-84  (1997)].  Consequently  even  if  all  else  would  fail,  one  would 
still  have  the  option  to  study  the  HPM  problem  variationally.  If  so,  then  one 
would  like  to  have  some  means  for  estimating  the  accuracy  of  such  a  solution. 
The  results  of  this  publication  shows  how  one  can  always  obtain  estimates  of  the 
errors  in  a  variational  solution  and  their  associated  quantities,  and  without  having 
to  know  anything  about  the  actual  solution. 

4)  Pub05  and  Pub07are  the  result  of  joint  work  between  Prof.  Gerdjikov  and  myself 
on  eigenvalue  problems.  It  is  cast  into  the  form  of  studies  of  integrable 
equations,  but  the  problem  is  really  one  of  a  study  of  direct  scattering  and  inverse 
scattering  in  one  dimension,  which  are  also  exactly  of  the  same  nature  as  the 
problem  of  solving  the  perturbed  cold-fluid  equations  in  plasma  physics.  Such 
studies  give  the  investigators  experience  in  understanding  these  problems  and 
experience  in  devising  methods  of  solution  for  such  problems. 

5)  In  integrable  systems,  one  has  two  classes  of  solutions:  solitons  (bound  states) 
and  radiation  (the  continuous  spectrum).  Given  the  scattering  data,  in  order  to 
reconstruct  the  potentials,  one  must  solve  a  set  of  linear  integral  equations. 

When  the  continuous  spectrum  is  absent,  then  these  linear  integral  equations 
can  be  reduced  to  a  set  of  only  algebraic  equations.  For  certain  integrable 
systems,  and  on  finite  or  semi-infinite  intervals,  one  can  invariably  convert  any 
continuous  spectrum  into  bound  states,  which  do  generate  soliton-like  solutions 
which  are  called  “virtual  solitons".  Integrable  systems  of  this  nature  are  mainly 
optical  parametric  equations  which  are  posed  on  finite  intervals,  such  as  second 
harmonic  generation  and  related  hyperbolic  equations.  In  Pub06,  Prof.  Steudel 
and  I  have  demonstrated  how  to  do  this  and  how  it  is  related  to  earlier  work  in 
NMR  (nuclear  magnetic  resonance).  What  we  demonstrate  here  is  that  the 
scattering  data  for  such  problems  can  be  reduced  to  a  countable  set  of  data.  In 
the  language  of  integrable  equations,  we  show  that  for  any  reasonable  initial 
value  problem  of  such  a  system,  the  continuous  spectrum  can  be  converted  into 
a  countable  set  of  bound  states.  Thus  the  inverse  scattering  problem  can  then 
be  reduced  to  an  algebraic  problem,  which  is  the  same  problem  as  constructing 
an  N-soliton  solution,  where  N  is  countable.  In  practical  terms,  it  is  further 
demonstrated  that  typically  one  can  take  N  to  be  finite,  albeit  large. 

6)  Pub08  is  joint  work  with  J.  Yang  and  is  a  solution  of  the  perturbation  problem  of 


the  Sasa-Satsuma  equation,  which  is  integrable  and  has  a  3x3  eigenvalue 
problem.  This  appears  to  be  the  first  time  that  anyone  has  explicitly  calculated 
the  squared  eigenfunctions  for  anything  beyond  the  AKNS  problem.  This  result 
has  a  breath  of  complexity  which  goes  well  above  that  of  the  AKNS  and  the 
Schrodinger  eigenvalue  problems.  It  also  contains  certain  universal  features 
which  one  can  see  will  continue  into  higher  order  problems.  It  has  just  this  month 
appeared  in  the  Journal  of  Mathematical  Physics. 

7)  Pub09  is  another  publication  in  collaboration  with  Prof.  Gerdjikov.  In  this  case 
we  have  presented  a  one-dimensional,  integrable  system  which  closely  models 
other  proposed  models  for  a  one-dimensional  Bose-Einstein  condensate.  Its 
basic  structure  is  a  multi-component,  nonlinear  Schrodinger  equation,  and  has 
some  freedom  in  assigning  values  to  the  coefficients  of  the  nonlinear  terms.  This 
is  recent  work  and  was  first  presented  in  the  Summer  of  2008. 

8)  PublO  is  work  in  progress  and  has  been  supported  by  this  contract.  At  the 
present  time,  no  variational  principal  has  been  developed  for  two-level  atoms  and 
the  phenomenon  of  self-induced  transparency  (SIT).  Although  two-level  SIT  is 
an  integrable  system,  three-level  SIT  is  not  integrable  except  for  special  level 
configurations  and  then  not  unless  the  coupling  coefficients  are  certain  special 
values.  The  advantage  of  having  a  variational  principle  for  three-level  atoms  is 
obvious.  We  could  execute  certain  studies  of  the  three-level  non-integrable 
cases. 

9)  Publl  is  also  work  in  progress  by  an  undergraduate  student.  This  work  will 
complement  Pub08  and  will  allow  one  to  study  perturbations  of  the  integrable 
three-wave  resonant  interactions. 

10)  Pub12  is  work  in  progress  and  is  concerned  with  ionospheric  interactions  when 
the  heater  wave  is  tuned  to  the  second  harmonic  of  the  electron  cyclotron 
frequency.  Experiments  at  this  frequency  have  recently  been  conducted  at  the 
HAARP  transmitter.  In  our  earlier  ionospheric  work  of  1992,  this  frequency  range 
was  inadequately  treated  since  at  that  time,  there  was  no  experimental  data  for 
that  regime.  This  work  is  intended  to  fill  that  gap  and  to  compare  theory  with  the 
experiments  for  that  frequency. 
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Abstract 

We  study  the  AKNS  (Ablowitz-Kaup-Newell-Segur)  inverse  scattering 
problem  for  rational  reflection  coefficients  on  a  semi-infinite  interval.  We 
demonstrate  that  the  Marchenko  integral  equations  for  the  AKNS  version  on 
such  an  interval  can  be  solved  in  a  direct  and  straightforward  way  by  algebraic 
methods  for  any  set  of  rational  reflection  coefficients,  vanishing  at  infinity  The 
general  AKNS  scattering  problem  for  this  case,  as  well  as  the  usual  symmetry 
reductions,  are  discussed.  The  connection  to  an  alternative  procedure  by  Rourke 
and  Morris  (1992  Phys .  Rev.  A  46  3631)  is  pointed  out.  Our  procedure  is 
built  around  a  constant  matrix.  A/,  which  can  be  constructed  from  the  poles 
and  residues  of  the  rational  reflection  coefficients.  Under  certain  conditions, 
which  we  define  as  minimal  symmetry  and  which  then  suitably  constrains  the 
distribution  of  eigenvalues,  we  show  that  it  is  always  possible  to  represent 
the  resulting  potentials  as  truncated  iV-soliton  potentials.  This  procedure  is  of 
interest  for  solving  initial-boundary  value  problems  of  integrable  hyperbolic 
systems  by  the  inverse  scattering  transform  (1ST)  applied  to  a  semi-infinite  or 
finite  interval. 


1.  Introduction 

Upon  the  development  of  the  inverse  scattering  transform  for  solving  initial  value  problems 
for  integrable  nonlinear  evolution  equations,  it  was  natural  that  efforts  would  soon  be  made 
toward  applying  this  method  to  initial-boundary  value  problems  as  well.  This  was  first  done 
by  Ablowitz  and  Segur  [I  ]  who  developed  an  inverse  scattering  transform  for  a  special  initial¬ 
boundary  value  problem  on  the  semi-infinite  interval  for  the  nonlinear  Schrodinger  equation. 
The  more  general  initial -boundary  problem  for  the  nonlinear  Schrodinger  equation  was  soon 
realized  to  be  more  complex,  in  that  Riemann-Hilbert  techniques  were  found  to  be  necessary 
for  their  solution  [2-8]. 
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On  the  other  hand,  for  hyperbolic  integrable  systems  where  one  may  use  characteristic 
coordinates,  the  above  complexities  are  generally  absent.  In  earlier  papers  [9-12],  an  inverse 
scattering  transform  for  solving  hyperbolic  initial -boundary  problems  on  a  finite  interval  was 
developed.  The  procedure  consists  of  three  steps: 

(1)  Solve  the  direct  scattering  problem  for  the  initial  values  at  t  =  0  on  the  interval 

0  <  jc  <  Xf. 

(2)  Determine  the  effective  S-matrix  at  /  =  //  >  0  by  the  use  of  boundary  data  at  the  left  end, 

jc  =  0,  only. 

(3)  Reconstruct  the  potentials  by  solving  the  inverse  scattering  problem  at  t  —  tf. 

In  solving  these  problems  [9-1 1]  it  was  observed  that  the  effective  reflection  coefficients 
could  always  be  replaced  by  an  equivalent  set  of  poles  and  residues.  Typically  however, 
there  are  usually  an  infinite  number  of  these  poles  and  residues  for  which  there  is  at  least 
one  accumulation  point.  However  by  selecting  a  finite  number  N  of  these  poles  from  the 
infinite  set  and,  with  increasing  /V,  one  could  obtain  increasing  accurate  approximations  of 
the  solution.  Whence  the  reconstruction  problem  for  these  systems  could  then  be  reduced  to 
the  reconstruction  problem  for  rational  reflection  coefficients  which  then  becomes  an  algebraic 
problem.  In  this  manner  one  could  bypass  the  need  to  solve  the  Marchenko  integral  equations 
[13],  replacing  them  instead  with  a  set  of  equivalent  algebraic  equations.  For  second  harmonic 
generation  and  for  degenerate  two-photon  propagation  in  the  low-excitation  limit,  it  has  been 
demonstrated  that  the  procedure  developed  by  Steudel  and  Kaup  (S&K)  did  indeed  work 
[10,11]. 

Related  to  this  but  solved  by  a  different  approach  is  the  earlier  work  of  Rourke  and  Morris 
(R&M)  [14]  whereby  they  solved  a  reconstruction  problem  which  arose  in  nuclear  magnetic 
resonance.  Here  they  were  given  the  reflection  coefficients  for  a  half-pulse  which  vanished 
for  r  >0  (with  r  corresponding  to  our  variable  jc).  From  this  information  they  needed  to 
reconstruct  the  potentials.  To  solve  this  problem,  they  cleverly  added  another  half-pulse ,  but 
vanishing  for  r  <  0,  which  was  crafted  such  that  the  composite  pulse  would  be  reflectionless. 
This  new  composite  pulse,  now  a  pure  multi-soliton  solution  and  extending  over  the  full  r- 
axis,  could  then  be  reconstructed  by  the  algebraic  soliton-lattice  method  due  to  Calogero  and 
Degasperis  [15].  Note  that  the  physical  solution  in  this  case  was  only  for  r  <  0.  Thus  by 
arbitrarily  adjusting  the  shape  for  r  >  0,  in  the  unphysical  regime,  they  were  able  to  convert 
the  desired  solution  into  the  left  halt  of  a  pure  /V- soliton  pulse  and  therefore  could  construct  the 
solution  entirely  by  algebraic  means,  thereby  also  bypassing  the  need  to  solve  the  Marchenko 
integral  equations.  Thus  again,  the  reflection  coefficient  on  a  semi-infinite  interval  became 
equivalent  to  rational  reflection  coefficients. 

The  most  difficult  aspect  of  the  reconstruction  problem  on  the  infinite  interval  is  always 
the  presence  of  a  continuous  spectrum,  in  which  case  one  cannot  in  general  reconstruct  the 
potentials  without  solving  the  Marchenko  integral  equations  [13].  However  in  the  above 
two  cases,  we  see  that  when  the  interval  for  the  potential  becomes  semi-infinite  or  finite, 
interestingly  the  basic  nature  of  the  reconstruction  problem  changes  dramatically,  reducing  to 
only  an  algebraic  problem  instead  of  the  problem  of  solving  a  set  of  integral  equations. 

As  we  shall  see,  on  a  semi-infinite  interval  or  a  finite  interval,  the  usual  relationship 
between  the  structure  of  the  potentials  and  the  scattering  data  will  become  different  from  that 
which  is  found  on  the  infinite  interval.  In  the  latter  case  [16],  each  soliton  corresponds  to 
a  pair  of  zeros  of  a  and  a,  which  for  potentials  on  compact  support,  also  are  the  poles  of 
the  reflection  coefficients.  On  the  other  hand,  the  radiation,  which  is  the  linear-like  part  of 
the  solution,  has  a  Fourier  transform-like  correspondence  with  the  reflection  coefficients  for 
real  values  of  the  spectral  parameter.  However  on  a  semi-infinite  or  a  finite  interval,  these 
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relations  no  longer  need  hold.  First,  as  in  the  above  two  examples,  the  continuous  spectra 
can  be  replaced  by  a  countable  infinity  of  regular  solitons  and/or  virtual  solitons  [11]  which 
are  represented  by  poles  of  the  reflection  coefficients.  Second,  the  location  of  these  poles  in 
the  complex  spectral  plane  is  generally  not  located  where  the  soliton’s  eigenvalue  is  located 
in  the  infinite  interval  case.  These  are  just  two  of  the  unique  differences  that  distinguish  the 
inverse  scattering  transform  (1ST)  on  the  finite/semi-mfinite  interval  from  the  well-known 
infinite  interval  case.  These  differences  will  be  pointed  out  as  we  develop  this  work. 

Since  we  have  two  distinctly  different  approaches  being  used  to  solve  the  same  basic 
problem,  it  then  becomes  appropriate  to  compare  and  discuss  the  similarities,  differences  and 
limitations  in  each  of  these  two  approaches.  In  the  process,  we  shall  also  show  how  one  can 
reformulate  this  reconstruction  problem  into  a  set  of  ordinary  differential  equations  (ODEs) 
with  constant  coefficients,  from  which  all  the  above  features  can  be  obtained. 

Section  2  consists  of  four  parts.  In  subsection  2.1,  the  method  of  solution  for  the  general 
AKNS  (Ablowitz-Kaup-Newell-Segur)  problem  where  r  is  unrelated  to  q  is  developed  for 
the  case  when  the  reflection  coefficients  are  rational.  We  show  that  our  previous  results 
[9-12]  on  the  Marchenko  equations  can  be  replaced  by  a  system  of  ordinary  differential 
equations  with  constant  coefficients,  (23),  whose  coefficient  matrix  M  is  built  from  the  poles 
and  the  residues  of  the  given  reflection  coefficients.  We  show  that  with  the  definition  of 
two  sets  of  ( N  +  /V)-vec tors,  the  two  resulting  ODE  systems  become  identical,  but  differ  in 
their  boundary  values.  With  the  use  of  the  matrix  exponential  function  an  exact  solution  is 
obtained  for  these  systems  and  for  the  reconstruction  problem.  A  condensed  summary  of 
this  new  method  of  solution  is  then  given  in  subsection  2  2.  In  subsection  2.3  we  derive  a 
system  of  partial  differential  equations  (PDEs)  fulfilled  by  the  auxiliary  functions  introduced 
in  subsection  2.1.  In  subsection  2  4,  we  give  a  reformulation  of  the  characteristic  equation 
of  the  matrix  M.  The  resulting  equation  (43)  gives  us  a  bridge  to  connect  our  procedure  to 
that  which  was  developed  by  R&M  [14].  Section  3  deals  with  symmetry  reductions.  In 
sections  4  and  5,  the  general  AKNS  system  is  restricted  to  what  we  call  minimal  symmetry.  In 
section  4  we  show  how  to  compute  all  scattering  data  from  the  rational  reflection  coefficients. 
In  section  5  we  show — quite  parallel  to  [14] — how  a  potential  with  rational  reflection 
coefficients  can  be  supplemented  to  become  a  complete  A-soliton  potential.  (By  ‘complete’, 
we  mean  that  on  the  mterval(s)  on  which  the  potential(s)  vanish,  supplemental  potentials  have 
been  added  such  that  the  total  potential  on  the  full  infinite  interval  has  become  a  pure  /V-soliton 
solution.)  Several  examples  are  given  in  section  ??.  In  particular  we  demonstrate  there  by  an 
explicit  example  that  when  r  is  unrelated  to  q  and  where  minimal  symmetry  is  not  present,  the 
R&M  method  [14]  does  not  work.  Summary  and  discussions  are  presented  in  section  6. 

2.  The  general  AKNS  system 

2. 1.  The  method  of  solution 

From  this  point  on,  we  shall  only  consider  the  AKNS  scattering  problem  [16,  26], 


under  the  conditions  that  r(jc)  =  0  =  q(x)  when  x  <  0.  We  will  solve  the  reconstruction 
problem  under  the  assumptions  that  the  reflection  coefficients  o(i;)  s  b(t;)/a(t;),< r(£)  = 
b(^)/a(^)  are  rational  functions.  Here  the  scattering  coefficients,  a,b,a>b ,  are  defined 
according  to  [16].  (Note  that  as  per  the  AKNS  convention,  the  overbar  does  not  indicate  a 
complex  conjugation,  but  rather  indicates  an  independent  quantity.  Complex  conjugation  will 
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be  indicated  instead  by  an  asterisk.)  We  will  write  C+(C_)  for  the  open  upper  (lower)  half 
£-plane.  Note  that  when  we  take  r(jc)  =  0  =  q(x)  for  x  <  0,  it  then  follows  from  appendix  5 
of  [16]  that  /?(£)  becomes  analytic  in  C.  while  b(t;)  becomes  analytic  in  C+.  In  general, 
we  shall  treat  this  problem  with  r  and  q  both  independent  and  unrelated,  although  we  shall 
also  discuss  what  restrictions  and  symmetries  apply  when  these  potentials  are  constrained  in 
the  usual  manners  such  as  r(x)  =  ±q( jc)  and  r(x)  =  ±q*(x).  The  Schrodinger  equation 
variation  of  this  problem  has  been  treated  in  [17-19]. 

For  solving  the  inverse  scattering  problem,  the  analytical  properties  of  the  scattering 
coefficients  a,b,ayh  are  decisive.  We  recall  that  for  any  potentials  q(x),r(x)  vanishing 
stronger  than  any  power  for  |jc|  — ►  oo,  is  an  analytic  function  in  C+  while  a(t;)  is  an 
analytic  function  in  C_.  It  has  already  been  pointed  out  above  that  under  the  more  stringent 
condition  q  =  r  =  0  for  x  <  0 ,  £«)  is  analytic  in  C+  while  b(£)  is  analytic  in  C_. 
Consequently  any  singularity  of  a  (a)  in  C+(C_)  must  be  a  zero  of  a(a). 

In  relation  to  the  above,  there  are  several  series  of  articles  dealing  with  various  features 
of  the  inverse  scattering  problem  for  the  Dirac  problem  (which  is  another  equivalent  form 
of  (I  ))  on  the  semi-line  (see  [20-25]  and  references  therein).  These  papers  use  the  spectral 
function  and  the  Weyl-Titch marsh  function,  where  the  latter  corresponds  to  the  reflection 
coefficient(s)  of  (1 ).  This  function  is  more  general  however  because  of  the  weaker  conditions 
used  concerning  the  asymptotics  of  the  potentials.  Also  2/i-dimensional  generalizations  of  the 
2x2  scattering  problem  are  treated.  However  these  articles  are  generally  restricted,  in  AKNS 
notation,  to  the  r  =  ±q*  case. 

There  are  two  ways  to  solve  the  Marchenko  equations:  either  invert  about  x  =  —  oo  or 
invert  about  x  =  +oo.  On  a  semi-infinite  interval  where  the  potentials  vanish  for  x  <  0, 
these  two  ways  are  no  longer  equivalent.  One  finds  that  there  are  both  subtle  and  distinct 
differences  between  them.  Here  we  shall  treat  the  case  when  we  invert  about  x  =  — oo.  At  the 
end  of  this  section  we  shall  make  some  comments  about  how  inversion  about  x  =  oo  would 
differ  from  that  given  here.  The  Marchenko  equations  for  inversion  about  x  =  +oo  were  first 
given  in  [26].  Inversion  about  x  =  — oo  can  be  equivalently  done  and  was  given  in  [16], 
equations  (4.39a),  (4.39b),  which  is 


L(x,y)+\ 

0 

1  G(x  +  y)  - 

f  L(x,  s)G(s  +  y)ds  =  0, 

/ -00 

x  >  y 

(2) 

L(x,y)  + 1 

(?) 

|  G(x  +  y)  +  j 

f  L(x,  s)G(s  -i-  y)  ds  =  0, 

-00 

x  >  y. 

(3) 

where  under  the  conditions  considered  above, 
G(z)  =  2^-jf  er(f  )c",?zdf 

G(z)  =  2-  f <r({)e,?zd<. 

2jt  Jc 


(4) 


In  (4),  the  contour  C  of  integration  goes  from  £  =  — oo  to  +oc  and  either  on  or  above  the  real 
axis  and  also  above  all  zeros  of  a (£)  in  C+.  Meanwhile  C  goes  from  £  =  — oo  to  +00  and 
either  on  or  below  the  real  axis  and  also  below  all  zeros  of  a(£)  in  C_ .  Note  that  as  mentioned 
before,  there  will  be  no  poles  of  b(£)  in  <C+  and  no  poles  of  b(£)  in  C_.  Therefore  C{C)  as 
well  may  be  defined  as  going  above  (below)  all  poles  of  cr((r). 

This  is  a  system  of  linear  integral  equations  for  the  two-componcnt  vector  functions 
L  =  (L 1,  L2)t,  L  =  (Lj,  L2)7 .  On  an  infinite  interval,  the  kernels  G,  G  can  generally  be 
given  as  sums  over  the  bound  state  eigenvalues  plus  Fourier  integrals  over  the  real  £-axis.  On 
the  semi-infinite  interval  considered  here,  they  may  be  reduced  to  a  similar  form. 
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From  the  Marchenko  equations,  the  required  potentials  can  be  reconstructed  from 
equations  (4.42^),  (4.42 d)  in  [16], 

q(x)  =  2L|(jr,  *),  r(x)  = -2L2(x,x).  (5) 

Now  we  assume  that  the  reflection  coefficients  cr(£)  and  <?(£)  are  rational  functions  and  that 
they  vanish  at  least  as  fast  as  0(1  /£)  for  |C|  — ►  oo.  Forz  <  0,  it  follows  that  the  contour C{C) 
may  be  closed  in  C+(C_),  giving  G(z)  =  G(z)  =  0.  For  z  >  0  then  the  contour  C(C)  may  be 
closed  in  C_(C+)  so  that  G  and  G  may  be  written  as  sums  over  the  poles.  Let  us  assume  that 

there  are  /V  distinct  poles  £y,  j  =  1 . N,  of  a  and  N  distinct  poles  £/,  /  =  1 . N ,  of  6 

with  the  respective  residues  and  <7/.  Note  that  if  £(£)  has  a  pole  or  poles  in  C_,  then  some 
of  the  £/s  could  lie  in  CL.  Similarly  for  the  £/’s.  Summarizing  then  we  find 

N 

G(z)  =  -i#(z)^<7y 

r 

G(z)  =  iff(z)£ *1^. 

1=1 

where  0(z)  denotes  the  Heaviside  step  function. 

It  is  very  easy  to  solve  the  Marchenko  equations  for  x  <  0.  Since  G(z)  =  G(z)  =  0  for 
z  <  0,  we  see  that  (2),  (3)  have  the  solutions 

L(jr,  y)  =  L(x,  y)  =  0,  y  ^  x  <  0,  (7) 

and,  due  to  (5),  it  follows  that 


q(x)  =  r(x)  =  0,  x<0.  (8) 

Now  we  consider  the  region  x  >  0.  Then,  since  G(z)  =  G(z)  =  0  for  z  <  0  we  may  take  —y 
instead  of  — oo  as  the  lower  limits  of  integration  in  (2),  (3). 

A  solution  of  these  integral  equations  may  be  given  by  a  closed  system  of  ordinary 
differential  equations  (ODEs).  As  a  first  step  to  achieve  this  goal  it  is  useful  to  replace  y  by 
— y  in  (3),  but  not  so  in  (2).  Since  our  range  of  the  variable  y  is  —  x  <  y  <  +jr,  with  x  >  0 
understood  to  be  a  fixed  parameter,  this  range  does  not  change  when  y  changes  sign.  Then 
the  substitution  of  G  and  G  from  (6)  leads  to 


N 

L(jc,  y)  + 

7  =  1 


L(x.-y)+i 

m  =  1 


(9) 

(10) 


To  solve  these  Marchenko  equations,  note  that  due  to  the  y  dependence  in  the  lower  limits 
of  the  integrals  in  (9)  and  (10),  one  cannot  use  the  usual  method  of  solution  for  an  /V-soliton 
solution,  wherein  one  takes  L(jc,  y)  and  L(x,  y)  to  be  sums  of  separable  functions  of  jr  and  y. 
Let  us  now  introduce  the  two-component  column  functions 


h 


(X,  y)  =  e 


/mU,  y)  =  — 


L(x.s)t  l?r'ds 
L(x,  di 


7=1 . N 

m  =  1 . N 


(ID 

(12) 
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fulfilling 


Ij(X.-X)  =  ,  L(x,X)  = 

(13) 

then  (9),  (10)  may  be  written  as 

N 

L(x,y)  =  \  ^2ojlj(x,y),  L(x,y)  =i 

j=  i 

and  according  to  (5)  the  potentials  are  expressed  as 

N 

Y^aJm(x,-y). 

m  =  1 

(14) 

q(x)  =  lL,(x.x)  =  2iy^g///i(x,-*) 

J 

(15) 

r(x)  =  -2 L2(x.  x)  =  — 2i  ^  omlml{x.  -x) 

(16) 

m 


Differentiation  of  (11),  (12)  with  respect  to  y  then  leads  to  a  closed  system  of  constant 
coefficient  ODEs  for  lj,  7m, 


(By  +  iij)lj(x,y)  =  -L(x.  -v)  =  y ) 

(dy  +  i(m)Im(x,  y)  =  L(x,  y)  =  iYajljix.  y) 


with  x  being  a  parameter.  Let  us  write  ljp<lmp ,  p  —  1, 2  for  the  components  of  ljjm  and 
combine  two  column  vectors 


each  of  them  consisting  of  N  +  N  components.  Now  we  define  an  (N  +  N)  x  (N  +  yV)-matrix 

M, 


M  = 


(Mu 

\M21 


M12\ 

M22) 


with  the  submatrices 

M11  =  diagtCi 


M2:  =  diag(Ci---C/v] 


(19) 

(20) 


M'2 

M21 


N  rows 


N  rows. 


In  this  way  we  find  that  both  column  vectors  ip  fulfil  one  and  the  same  ODE  system, 
ia^Cr,  y)  =  Mip(x ,  y),  p  =  1.2,  <  y  <  x 


which  is  subject  to  two-point  boundary  conditions,  cf  (13), 


(a) 

lj\(x.  -x)  =  1. 

Iml(x.x)  =  0 

(b) 

lj2(x,  -x)  =  0, 

1m  2(X,X)  =  - 

respectively. 


(21) 

(22) 

(23) 

(24) 

(25) 
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The  general  solution  of  this  problem  can  be  given  in  terms  of  the  matrix  exponential 
function 


T(>0  m  exp(i My)  (26) 

so  that  it  holds  generally 

lP(x,  yi)  =  T(y,  -  y2)\P(x,  yi).  (27) 

In  quite  an  analogous  way  to  (19)  T  may  be  divided  into  four  block  matrices.  Then  from  (27) 
together  with  (24),  (25)  we  find 

/;,(*,*)  =  £([ru(2x)rV.  (28) 

j ' 

ln,2(x.  - X )  =  -  ^([r22(-2x)]-')mm..  (29) 

m' 


From  (15),  (16),  we  now  obtain  the  potentials  q(x)  and  r(jc).  Due  to  (27),  (14),  we  now  know 
L  and  L  from  which  one  could  also  construct  the  Jost  functions  0  and  0  [16]. 

With  the  above,  the  question  of  the  existence  of  a  solution  naturally  arises.  We  can  briefly 
address  this  here  by  referring  to  the  known  solutions  of  the  infinite  interval  case.  The  key 
question  is  could  T11  and  T22  ever  be  non-invertible?  As  discussed  in  AKNS  [16],  on  the 
infinite  interval  and  for  the  case  where  r  =  —  q*y  the  solution  always  exists  for  any  finite 
number  of  solitons  and  for  any  set  of  reflection  coefficients,  vanishing  sufficiently  rapidly  as 
£  — ►  oo.  Thus  in  this  case,  it  follows  that  one  could  in  general  expect  that  the  matrices  T11 
and  T22  would  always  exist  and  be  nonsingular.  Similarly,  when  r  =  +q *,  it  follows  that 
if  the  reflection  coefficients  are  always  bounded  by  unity  and  if  there  are  no  bound  states, 
then  it  was  also  shown  that  nonsingular  solutions  for  r  and  q  on  the  infinite  interval  would 
always  exist.  Again  providing  that  those  conditions  were  satisfied,  then  one  would  in  general 
expect  the  matrices  T11  and  T22  to  exist  and  be  nonsingular.  However  if  these  conditions 
are  found  to  not  exist,  then  as  show  n  in  [16],  the  presence  of  even  a  single  bound  state  in  the 
scattering  data  when  r  =  +q *  would  give  a  singular  solution  of  the  form  1  /  sinh  qx.  Whence 
the  solutions  for  r  and  q  would  be  singular  at  some  values  of  x  on  the  infinite  interval.  In  the 
more  general  case,  if  r  and  q  were  generally  unrelated,  the  same  would  still  generally  occur 
(singular  solutions).  It  is  found  that  on  a  finite  or  semi-infinite  interval,  these  singularities 
could  well  occur  only  in  the  nonphysical  regime,  and  then  leaving  nonsingular  solutions  in 
physical  regime.  Thus  except  in  the  case  where  r  =  —<7*,  determining  when  the  solution 
exists  in  the  physical  regime  is  an  open  problem  and  in  need  of  solution 

Let  us  now  discuss  the  problem  of  inversion  about  x  =  +oc  for  the  semi-infinite  case.  As 
long  as  x  >0,  the  reconstruction  process  would  be  exactly  the  same  as  in  the  infinite  interval 
with  one  key  exception.  First,  q(x)  =  r(x)  =  0  for  x  <  0  is  equivalent  to  the  condition  that 
b(t;)  is  analytic  in  C+  and  /?(£)  is  analytic  in  C_.  Note  that  in  general  there  still  could  be  poles 
of  /?(£)  in  C+  and  poles  of  /?(£)  in  C_.  Now  according  to  equations  (4.38a),  (4.38/?)  in  [16] 
we  would  have  to  use  the  reflection  coefficients  p  =  b/a ,  p  =  b/a  instead  of  a,  a.  Then  we 
would  have  to  take  a  contour  C  of  integration  going  above  all  zeros  of  a  but  below  all  poles  of 
b  in  C+  and  a  contour  C  going  below  all  zeros  of  a  but  above  all  poles  ofb  in  CL.  (Following 
the  derivation  outlined  in  [16],  equations  (4.30}-(4.33),  it  is  rather  direct  to  show  that  only 
zeros  of  a  (a)  contribute  to  these  contours,  and  that  the  poles  of  b  and  b  do  not.)  Then  the 
reconstruction  would  proceed  as  in  the  infinite  interval  case. 

However,  if  one  was  using  inversion  about  x  =  +00  and  was  also  interested  in  verifying 
that  the  solutions  for  r  and  q  were  indeed  zero  when  x  <  0,  then  that  proof  would  have  to 
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follow  along  the  lines  of  the  earlier  parts  of  this  section  in  order  to  obtain  the  solution  of 
the  Marchenko  equations  for  the  region  x  <  0.  The  procedure  does  work,  although  much 
more  effort  would  be  required  just  to  rediscover  that  for  a  simple  solution  such  as  a  truncated 
one-soliton  potential,  one  does  have  indeed  r  and  q  vanishing  for  x  <  0. 

In  the  case  of  a  finite  interval,  it  would  not  matter  about  which  direction  one  would  invert 
since  both  ±oo  are  excluded  from  the  finite  interval  and  one  would  therefore  need  to  proceed 
as  in  the  earlier  parts  of  this  section. 

2.2.  Solution  procedure 

Here  we  shall  briefly  summarize  the  preceding  results  into  a  rather  simple  procedure.  We  start 
with  two  rational  reflection  coefficients  cr(£).  &(()  vanishing  at  infinity  and  with  no  multiple 
poles  but  otherwise  arbitrary.  Let  N.  N  be  the  respective  numbers  of  poles.  Then  one  proceeds 
as  follows: 

(1)  Establish  the  matrix  M  from  the  poles  and  residues  of  a(£),  <r(£)  according  to  (19 ) — ( 22 ) 

(2)  Take  the  matrix  exponential  function  r(y)  =  exp[iA/y],  see  (26).  There  are  several 
methods  to  get  this  function.  A  related  discussion  can  be  found,  e.g.,  in  [27].  One  way 
is  to  diagonalize  A/,  compare  subsection  2  4.  Since  the  matrix  exponential  is  usually 
included  in  any  standard  mathematical  software  system  such  as,  e.g.,  Mathematica,  we 
shall  give  no  further  details  here. 

(3)  Invert  the  submatrices  Tn,  F22  to  get  the  functions  lj\(x.  jc),  lm 2(x,  —  v)  from  (28),  (29) 
so  that  the  potentials  q(x ),  r(x)  are  finally  found  according  to  (15),  (16).  The  pointwise 
invertibility  of  these  submatnx  functions  cannot  be  guaranteed  generally  except  in  the 
case  of  r  =  —q*. 

2.3.  A  system  of  PDEs  for  the  auxiliary  functions 

The  two  2  x  (N  +  N)  functions  ljP(x.  y),  Jmp(x%  y),  j  =  1 . N.  m  =  1 . /V,  p  —  1,2 

defined  by  ( 1 1 ),  ( 1 2)  and  combined  to  form  two  (N  +  /V)-vec tors  \\ ,  I2  in  ( 1 8)  do  play  a  key 
role  in  the  procedure  developed  above.  They  are  defined  in  (1 1 ),  (12)  as  Fourier-type  integrals 
over  the  functions  Lp%  Lp ,  p  =  1,2,  which  were  introduced  in  [16].  Vice  versa  Lp ,  Lp  are 
expressed  as  sums  over  ljPJmp  with  constant  coefficients,  see  (14).  Here  we  will  derive  a 
system  of  PDEs  for  lj,  L  which  resembles  such  a  system  for  Lp ,  Lp  known  from  [16],  and, 
indeed,  the  latter  can  be  rediscovered  from  the  former.  We  believe  that  it  might  be  of  interest 
to  point  out  this  nontrivial  structure. 

Obviously,  together  with  \p(x.  y),  p  fixed,  each  of  the  column  vectors  3*1^,  (3r  ± 

3^)1^  fulfils  the  ODE  system  (23)  with  the  same  matrix  M  for  any  fixed  x.  In  particular,  let  us 
take 


J  = 


N,  m  =  1 . ft. 


(30) 


Due  to  (24),  (23)  and  with  using  (15)  we  find  boundary  values 
hj(xH  —x)  =  0 


(31) 


j 


From  a  comparison  of  (31 )  with  (25)  we  conclude  h(jr,  y)  =  ^(x)l2(.t,  y). 
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Quite  similarly  we  may  define 


f  (A,y) 


(dx+dy)\2(. *,y) 


to  get 


fj(x,  -a)  =  2dylji(x*  y)  =  — 2i  Vam/m2(A,  -a)  =  r(or) 
fm(x*  *)  =  0. 


(32) 


(33) 


From  (33)  compared  to  (25)  we  get  f(jc,  y)  =  r(A)li(A,  y).  Putting  together  we  may  write 
down  the  relations 


(3,  -3>)liU,y)--?(A0l2(-v,y)  =  0, 
Or  +  3>)l2<A,  y)  -  r(A)l,(A,  y)  =  0. 
Due  to  ( 1 4)  from  (34)  we  get  the  PDEs 


(34) 


(dx  +  3 y)Ll(x,y)  -q(x)L2(x,y)  =  0 

(3*  -3QL2(A,y)-r(*)Li(*,y)  =  0 

-  (35) 

Or  -3>)Li(a,  y)  -q(x)L2(xsy)  =  0 

(3r  +  dy)L2(x,  y)  —  r(x)Li(x,  y)  =  0 

corresponding  to  equations  (4.46a),  (4.46b)  in  [16],  as  well  as  the  conditions  for  consistent 
boundary  values 


L j(a,  —a)  =  L2( a,  -a)  =  0  (36) 

L2(a,  a)  =  -r(A)/2,  Lj(a,  a)  =  <7(a)/2.  (37) 

As  one  can  see  in  (35),  L\  and  L2  propagate  along  the  characteristics  a  ±  y.  Whence  if 
q  and  r  were  suitably  bounded  on  the  interval  of  interest,  then,  as  in  [16],  it  would  follow  that 
L\  and  L2  would  exist. 


2.4.  The  characteristic  equation 


By  considering  the  characteristic  equation  of  the  matrix  M,  we  can  relate  our  results  to  the 
procedure  developed  by  Rourke  and  Morris  [14].  This  equation  is 

P(0  =  det(Af  —  <■/)  =  0  (38) 


where  /  denotes  the  unit  matrix  of  order  N  +  N.  Let  us  first  give  a  reformulation  of  the 
characteristic  polynomial  P(f).  We  divide  the  j  th  column  of  the  matrix  ( M  —  £/)  by  (£  —  (j) 
and  the  (N  +  k) th  by  (£  —  {*)  so  that  we  may  write 


P(0  = 


N 


n<o-o 


N 


m-n 

k=l 


det  W. 


Here  W  denotes  a  matrix  consisting  of  four  block  matrices. 


W  = 


(39) 


(40) 
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such  that  Is  and  Is  are  unit  matrices  of  orders  N  and  N,  respectively,  while  B  is  an  N  x  N- 
matrix  with  N  identical  rows  (6j, . . . ,  bs)  and  C  is  an  N  x  N-matrix  with  N  identical  rows 
(ci . cs)  with 

bk  =  0k/(S  -ft).  cj  =  -OjH(  —  0).  (41) 

Obviously,  according  to  the  presumptions  given  above,  the  reflection  coefficients  a,  a  may  be 
written  as  the  sums 

N  N 

<t(C)  =  -]Tc7,  =  (42) 

7-1  k= 1 

Due  to  the  lemma  given  in  the  appendix  together  with  (41),  (42)  we  find 

det  W  =  1  +or(£)tf(£).  (43) 

It  is  obvious  that  the  zeros  of  (43)  are  eigenvalues  of  the  matrix  M.  However,  not  all  eigenvalues 
of  M  need  to  be  zeros  of  (43).  It  may  happen  that  d(£)  could  just  accidentally  vanish  at  one 
of  the  poles  in  <t,  say  at  £  =  f*.  In  such  a  case,  in  the  product  ad ,  the  corresponding  pole  in 
a  would  be  canceled  and,  due  to  the  factor  (C  —  <*)  extracted  in  (39),  this  value  of  would 
become  an  eigenvalue  of  M.  More  generally:  when  some  of  the  poles  in  a  and  in  d  are 
canceled  in  the  product  ad  then  the  corresponding  subset  of  {£;‘,  (j)  together  with  the  zeros 
of  1  +  ad  is  just  the  eigenvalues  of  M. 

Rourke  and  Morris  [14]  determined  the  eigenvalues  of  the  completed  /V-soliton  states  as 
the  zeros  of  1  +ad  for  the  case  r  =  —  </*,  in  which  case  <7(0  =  <?*(£*)  (sec  (55),  (59)).  Thus 
in  their  treatment,  they  found  it  necessary  to  exclude  such  accidental  cancelations.  These 
cases  of  accidental  cancelations  were  later  handled  by  Rourke  and  Saunders  [28]  However 
one  should  note  that  no  such  difficulty  occurs  in  our  procedure  as  described  in  subsection  2. 1 
and  continued  below. 

3.  Reductions 

In  physical  applications,  the  general  AKNS  system  typically  is  simplified  by  one  or  another 
symmetry.  Upon  taking  (a)  r  =  ±q  or  (b)  r  =  ±q*  we  arrive  at  a  Zakharov-Shabat  problem. 
Here  of  course,  we  would  have  N  =  N .  The  respective  reductions  for  these  symmetries  are 
already  given  in  [16].  However  here  we  are  more  interested  in  the  consequences  that  this  has 
for  the  procedure  developed  in  the  foregoing  section. 

(a)  For  r  =  cq,  €  —  ±1  it  holds 

<7«)  =  -e<x(-<),  G(z)  =  €G(z)  (44) 

and  L2  —  eLi,  Li  =  —  L\  There  are  2 N  poles  £>,  lj>  j  —  1 . N,  and  we  will  assume 

here  that  they  all  are  different  from  one  another.  (A  degenerate  set  of  poles  would  require 
a  separate  treatment.)  These  poles  may  be  ordered  such  that 

lj  =  -<y‘«  dj  =  —€Oj.  (45) 

Due  to  (1 1 ),  ( 1 2)  we  obtain 

lj\ (a\v)  =  elj2 (x,  -y),  lj2(x ,  y)  =  -/>i(.v,  -y).  (46) 

The  2N  x  2N-matrix  M  has  the  properties 

M22  =  -M".  M21  =  -f  MIJ.  (47) 
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By  the  use  of  the  2N  x  2/V-matrices 

«-c.  i) 

with  0  and  1  denoting  the  N  x  N  null  matrix  and  the  N  x  N  unit  matrix  respectively,  the 
symmetry  (47)  of  M  can  be  characterized  by  the  relation 

EME~'=~M.  (49) 

For  any  natural  number  n,  it  holds  EMnE~i  =  Consequently,  the  matrix 

exponential  function  (26)  has  the  symmetry 

ET{y)E-'  =  H— y).  (50) 

If  X  j  is  an  eigenvalue  of  M,  then  —  k\  is  an  eigenvalue  as  well.  We  shall  assume  that  there 
are  no  degenerate  eigenvalues. 

It  then  follows  that  there  are  2N  eigenvalues  A.j, . . . ,  k2s  which  may  be  ordered  such  that 
Xj+N  =  —Xj.  Let  D  be  the  diagonal  matrix 

D  =  diag[Xi,X2...X2iv].  (51) 

There  exists  a  2N  x  2N-matnx  A  such  that 

AM  —  DA.  (52) 

The  rows  of  A  are  the  eigenvectors  of  M.  Multiplying  (52)  by  E  from  the  left  and  by  E  1 
from  the  right,  using  (49)  and  the  obvious  relation  EDE~X  =  —  D,  we  hnd  that  in  (52) 
A  may  be  replaced  by  EAEX  or,  as  well,  by  a  linear  combination  of  both  matrices.  In 
particular  we  may  replace  A  by  A  =  A  —  EAE~X  fulfilling  the  symmetry  EAE~X  =  -  A. 
Thus,  without  loss  of  generality  we  may  require  the  diagonalizing  matrix  A  to  fulfil  the 
same  symmetry  as  M ,  cf  (49),  namely 

EAE~'  =  -A.  (53) 

(b)  For  r  =  €q*  ,€  =  ±  lit  holds  that 

=  -€o*(0).  G(z)  =  -e<T(z),  (54) 

and  Li  =  eLJ,  L2  =  —  L*.  The  poles  may  be  ordered  such  that 

Ij  =  Cf,  oj  =  -€crf.  (55) 

lj\(x*y)  =e7y2(x*,-.y),  lj2(x<  -y)  =  -/y,Cv,  y).  (56) 

The  symmetry  of  the  matrix  M  is  characterized  by 

EME  ]  =  M*  (57) 

with  the  same  matrix  E  defined  by  (48),  and  in  quite  an  analogous  way  as  for  (a)  we  find 
the  symmetry  of  T, 

£r(y)£~*  =  r*(— V).  (58) 

With  the  specification  (54)  the  equation  (43)  reads 

det  W  =  1  -  6<t(£)<t*(C*).  (59) 

For  6  =  —1  the  equation  det  W  =  0  corresponds  exactly  to  an  equation  found  in  [14], 
which  is  located  in  the  text  below  their  equation  (24).  In  comparing  these  equations  we 
only  need  to  keep  in  mind  (a)  that  their  variable  r  corresponds  to  our  variable  *  and  (b) 
that  they  are  considering  a  pulse  vanishing  for  r  >  0  while  we  take  the  potential  to  vanish 
for  x  <  0. 
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4.  Computing  all  scattering  data  from  the  reflection  coefficients 


It  has  been  demonstrated  in  appendix  5  of  [16]  how  one  can  determine  all  scattering  data  from 
the  so-called  primordial  scattering  data.  In  particular,  for  a,  a  being  rational  functions  of  £ 
and  regular  at  infinity  together  with  the  requirement  that  b (£)  (6(£))  is  analytic  in  C+(C_ ),  we 
know  the  zeros  of  a(a)  in  C+(C_)  and  so  we  are  able  to  determine  a ,  b ,  a,  b  explicitly  by  the 
use  of  this  procedure.  Particular  symmetries  such  as  discussed  in  section  3  would  simplify  the 
procedure.  Instead  here,  more  generally,  we  will  require  a  minimal  symmetry.  It  will  be  seen 
that  this  symmetry  is  a  sufficient  condition  for  the  R&M  method  of  construction  to  work. 

First  we  assume  N  =  N,  i.e.,  that  there  are  as  many  poles  £y,  j  =  1 . /V  of  a  as  there 

are  poles  lj  of  <r(f).  Moreover  we  assume  that  for  any  fixed  j  the  points  ?y,  fy  are  in  different 
half-planes  C+,  C_.  E.g.,  for  the  reductions  (a)  r  —  ±q  or  (b)  r  =  ±q*  with  taking  £y  =  —  fy 
or  fy  =  f  ?,  respectively,  these  conditions  are  fulfilled.  Note  that  this  minimal  symmetry  is 
not  a  condition  on  the  potentials  but  on  the  reflection  coefficients  a,  6 . 

We  define  two  functions  /,  /, 

no -a n,.  7(0* a/ n,.  n,  ■  f[  (60) 

Clearly,  log  /  is  analytic  in  C+  while  log  /  is  analytic  in  C_,  and  both  functions  vanish  at  least 
as  fast  as  l/<  as  |£|  — ►  oo.  Now'  we  take  the  integral  over  log [/ «■') / (%')]/($'  —  f)  along  the 
real  £'-axis.  This  is  a  sum  of  two  integrals.  For  the  term  containing  log  /  (log  /)  the  contour 
of  integration  may  be  closed  in  C+(C_).  Making  use  of  the  relation 

a(^)a(0  +  =a(Oa(Oi  1  +  <T(f)a«)]  =  1  (61) 


we  arrive  at 

-log/«).  cec+ 

log  flO.  CeC_.  1  ’ 

The  product  a(£)a(£)  has  2 N\  ^  2N  poles.  This  means  that  o  contains  (N  —  N\) 

multiplicative  factors  of  the  type  (£  —  &)/(£  —  *  =  N \  +  1 . N ,  which  are  canceled 

in  the  product  by  the  reciprocal  factors  contained  in  6 .  There  are  2/Vj  zeros  of  1  +  <7(£)<t(£), 
say  Ay,  A/,  /  =  1 . N\ ,  with  an  as  far  arbitrary  order.  Thus  we  may  write 

Ni 

1  +<r(f)a(C)  =  |~[ 
l=i 


(X  -  X,)(£  -  A() 

_(<-£)(?  ~h)  _ 


(63) 


1 

2  71 


d£' 

7  /  l0g[l+a(^)<7(r)]-T37  = 

*  J -00  s  s 


Now  we  substitute  (63)  into  (62).  It  is  helpful  to  write  down  the  subsequent  formula  which  is 
easily  verified  by  partial  integration  and  then  closing  the  contour  in  C_.  For  f  €  C+  it  holds 
that 


s  —  z\  1  ds 
s  -Z2js  -t; 


log[1-2l/0. 

Z|  €  C_. 

z2  e  C+ 

-log[i  -  z2/0. 

Z|  €  C+, 

Z2  €  C_ 

log[(f  -  Z|)/(C  -Z2)]. 

Zl  €  C_, 

Z2  €  C_ 

0, 

Zl  €  C+. 

Z2  €  C+. 

(64) 


The  denominator  in  (63)  has  exactly  N\  zeros  in  C+  and,  as  well,  N\  zeros  in  C_. 
Then  the  same  must  hold  for  the  numerator  because  otherwise — according  to  (64) — the 
integration  would  lead  to  a  singularity  at  f  =  0.  Without  loss  of  generality,  we  may  take 
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Xi  e  C+,  k(  e  C_,  /  =  1 . N\.  We  introduce  two  discrete  functions  c+,  Z-  defined  on  the 

set  (j*  (j*  j  =  1 . N,  as  follows: 

if  €  C± 

if  €  C±. 


z±(Sj)  =  z±(Xj)  = 


(65) 


Choosing  £  €  C+  we  find 


/=1 

and  combination  with  (60)  leads  to 

N\ 


a(<)=ri73 

i=i  s 


n 


C  -ft 
<-ft' 


(66) 


(67) 


Note  that  this  formula  does  not  distinguish  which  of  the  £/,  /  =  I . N\,  is  in  C+  and  which 

in  C_.  Afterward  we  may  even  admit  values  ft  on  the  real  axis.  In  an  analogous  way  we  find 


ri 

/= i  s  1  {*>*,,<* 


(68) 


5.  The  completed  iV-soliton  potential 


In  this  section  again  we  assume  what  we  have  called  minimal  symmetry  in  the  foregoing 
section.  We  may  combine  the  potential  qR(x)  =  q(x )  vanishing  for  x  <  0  with  a  potential 
qi(x)  vanishing  for*  >  0.  The  respective  scattering  matrices  are  defined  as 


Sr(0  = 


(69) 


and  the  scattering  matrix  Sc  of  the  combined  potential  is  just  the  product 


( ac 


Sr  Si  = 


0  Rfl  i  ~  ^R^L 

Hr^l  +bRai 


aRf>L  +  ^R^l 
&R&L  ~  t>R^L 


(70) 


Sr  and  Si  have  to  be  unimodular  matrices,  i.e.. 


( aa  +bb)R  =  ( aa  +bb)i  =  1. 


(71) 


Note  that  aR<  bR, . . .  here  are  the  same  as  a,  b _ outside  this  section.  Below  we  use  the 

notation,  cf  [16], 

p  =  b/a,  p  =  b/a,  a  =  b/a ,  6  =  b/a  (72) 

with  subscripts  R  or  L  to  be  added  From  appendix  5  in  [16]  we  know  that  aR,bR,ai,  bi  are 
analytic  functions  in  C+  while  or,  />/?,  di,bi  are  analytic  functions  in  C_. 

Now  we  require  that  the  combined  potential  should  be  reflectionless,  i.e., 
bc  =  be  =  0  or 


Pl  —  — <7/?.  Pi  —  —or.  (73) 

From  (73)  together  with  (7 1 ),  (72)  we  get 

a  [  _  _  a  i  uR 

ac  —  aRai(  1  +  oRoR)  =  —  (aRaR  +bRbR)  =  —  =  — .  (74) 

aR  aR  <ii 
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From  (67),  (68)  we  know  or,  or,  and,  clearly,  from  knowledge  of  Pl(0  due  to  (72)  we  may 
determine  aL,  at  by  a  procedure  quite  analogous  to  that  used  in  section  4.  In  particular  we  get 

N\  „  _ 

n 


"■w-TIt 

;= i  s 


K  -Jl 

i-t 


\k>N,,(teC.) 


j  -ft 
<-ft' 


(75) 


When  now  we  identify 

h  =  z+(ft). 

then,  due  to  (74),  we  arrive  at 


i-k  =  Z-(ft). 


MO  =  aL/aR  =  |~] 
j- 1 


k  =  Ni  +  \ . N 


ac  =  1  /oc- 


(76) 


(77) 


This  is  the  typical  structure  of  a,  a  for  an  /V-soliton  state. 


Conclusion.  We  see  that  under  the  assumption  of  minimal  symmetry,  the  considered  right 
potential  (vanishing  for  x  <  0)  can  be  combined  with  a  left  potential  such  that  the  combined 
potential  is  reflectionless,  i.e  ,  it  is  an  N-soliton  potential.  This  statement  generalizes  the 
corresponding  results  in  [14,  28]. 

Combining  these  results  with  (38),  (43),  (63)  we  see  that  the  zeros  of  ac,  j  —  1 . jV, 

together  with  the  zeros  kj  of  ac  are  the  eigenvalues  of  our  key  matrix  Af,  while  at  the  same 
time,  they  are  also  just  the  eigenvalues  of  the  completed  /V-soliton  potential. 


6.  Examples 

We  shall  now  present  a  few  examples  of  solutions  constructed  by  the  application  of  the 
procedure  outlined  in  subsection  2.2.  Here  N  and  N  will  be  restricted  to  be  equal  to  2  at  most, 
in  which  case  the  resulting  potentials  can  be  given  in  analytical  form. 

Example  1.  The  simplest  example  is  to  take — in  the  case  of  no  symmetry  reduction — 
yv  =  1 ,  N  =  0.  The  ‘matrix*  M  then  has  one  element  only, 

M  =  Af"=ti.  (78) 

Consequently  we  have 

r(jt)  =  rn(.v)  =  exp[if,.t],  (79) 


/n(*.  *)  =  expl— 2ifi*], 


(80) 


q(x )  =  2\0(x)oi  exp[-2i£i  *],  r(x)  =  0. 


(81) 


We  have  to  choose  the  pole  fi  €  C_  in  order  to  obtain  an  exponentially  decreasing  potential. 
It  is  easy  to  solve  the  direct  scattering  problem  for  the  potentials  (81).  Using  scattering  data 
a,  b,a,b  in  AKNS  notation  we  find 


a  —  a  =  1 , 


b  =  0, 


b  = 


°\ 


C-Ci 


and  consistently  we  recover  the  reflection  coefficients 


b  o  i 

or(C)a-  =  — i- 
a  f  -  l 


0(0  s  — 
a 


0. 


(82) 


(83) 


Since  the  coefficients  a  and  a  have  no  zeros,  this  solution  has  no  solitons.  Thus  it  cannot  have 
a  completion.  Whence  this  demonstrates  that  there  are  scattering  data  for  which  a  completion 
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cannot  be  done.  However,  this  system  also  does  not  have  the  miminal  symmetry  introduced 
above. 


Example  2.  We  choose  reduction  (a)  of  section  3  and  take  N  =  N  =  1,  ft  =  —  ft,cr|  = 
— €<j\ ,  6  =  zb  1  to  obtain  the  matrices 


M  = 


-ecrA 
"Cl  ) 


(84) 


T(y)  =  exp[iA/y]  =  1  cosh(/Xiy)  +  — M  sinh(//|  v) 

Mi 


with 


a  i  =  sH  - fa*- 

According  to  (28),  (29),  (45)-(50)  we  hnd 

/it<jc.jr)  =  [r"(2jc)r‘.  7,2(x.  -X)  =  -[r22(-Zr)]-'  =  — jc) 


(85) 

(86) 
(87) 


2\Q(x)oi 


q(x)  er(x)  cosh(2/xi jc)  +  (ift/Mi)  sinh(2/X|X) 

/x  i  must  not  be  purely  imaginary  in  order  to  get  an  exponentially  decaying  potential. 


(88) 


Example  3  (cf  [12]).  Now  we  choose  reduction  (b)  of  section  5  and  take,  in  particular, 
N  =  N  =  1,  ft  =  ft  +  ir/j,  ft  =  ft  —  ir/i,  q \  real,  6\  =  — eo jft  6  =  zbl  to  obtain  the  matrices 


with 


M=(S'+irl'  ~(aJ  V  M0=('m  ~fa<) 

\  0\  ft-i>?i/  \-a,  -irii ) 


P(v)  =  exp[iAfv]  =  e1 


,^i>' 


j^l  cos 


cosh(/ri  v)  +  —  A/o  sinh(/C|  v) 


] 


yjrii-e\o  ll2 


(89) 


(90) 


(91) 


Then  equations  (28),  (29)  hold  as  before,  and  due  to  (15),  (16),  we  hnd  the  potential 

2i0(x)r7i  exp[— 2iftjc] 


q(x)  =  €r*(x) 


cosh(2/rtx)  —  (rji/K\)  sinh(2/^ix) 


(92) 


Obviously,  for  €  =  —  1  the  potential  is  nonsingular  since  we  will  have  |r/i|  <  |xi|.  W'ithout 
loss  of  generality,  we  may  take  the  branch  in  (91)  to  be  positive,  i  e.,  k \  >  0,  in  which  case  it  is 
a  soliton  truncated  at  x  =  0.  If  r)\  >  0  then  there  is  a  maximum  of  |<7|  at  a  point  x  =  jcq  >  0. 
If  t?i  <  0  the  untruncated  soliton  has  its  maximum  value  at  a  point  x  =  xq  <  0.  We  note  that, 
quite  generally,  a  pole  of  a(£)  in  the  lower  half-plane  corresponds  to  a  soliton  with  its  center 
being  in  the  unphysical  region.  This  has  been  called  a  virtual  soliton  [9]  It  is  easy  to  see  that 
the  eigenvalues  of  the  matrix  M  in  (89)  are  ft  ±  i/ci,  and  that  these  are  just  the  eigenvalues  of 
the  completed  soliton. 

For  €  =  +1  we  must  introduce  the  constraint  \o\  \  <  \q\  \  in  order  to  have  real  K\.  Then  for 
q i  <  0,  the  potential  is  nonsingular  due  to  the  0  function  However,  the  untruncated  potential 
will  have  a  singularity  in  the  unphvsical  region ,  x  <  0.  If  q \  >  0,  then  we  will  have  a  singular 
soliton. 

We  note  that  in  all  three  examples,  cf  equations  (81),  (88),  (92),  it  holds  that  the  potential 
is  discontinuous  at  the  truncation  point  and  the  discontinuity  is  just  the  residue.  Ox,  multiplied 
by  2i  [12],  i  e., 


q  (+0)  =  2iai. 


(93) 
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Figure  1.  Example  4.  Truncated  two-soliton  potential  gU).  The  truncated  left-hand  pan  is 
indicated  by  the  dashed  line.  C  =  —  1.5i.  ft  =  2i,  ft  =  *2  =  3f.  k\  =  2.5i 


Example  4.  Choosing  the  reduction  r  =  —  q*  %  a(£)  =  *)  we  take 


o(0  =  C 


a  i  a  2 

K  -  i\  +  }  -  Ti 


(94) 


with 


o\  =  C 


<2  -  £T 

ft  -  ft  ’ 


02  = 


C£l^L 

Cl  -  <2  ' 


(95) 


It  is  the  particular  feature  of  this  example  that  o  does  contain  a  factor  (f  —  )/(£  “  ft)  which 

is  canceled  in  the  product  06  hy  the  reverse  factor  contained  in  6 .  Now  we  specify  C,  ft ,  ft 
as  purely  imaginary,  ft  =  iij|,  ft  =  ir?2-  Then  the  matrix  Af,  cf  (19),  is 


<  i//i 

0 

«f 

°2  > 

0 

Of 

~0\ 

—°2 

-i  ’h 

0 

\-Ol 

-O7 

0 

-■  m) 

Further  specification  C  =  —  1.5i,  q  1  =  2,  r; 2  =  3  leads  to  eri  =  7.5i,  02  =  — 9i.  Then  the 
eigenvalues  of  the  matrix  M  are  found  to  be  ft  =  2.5i,  ft  —  3i,  -ft  =  — 2.5i,  ft  =  —  3i.  Steps 
(2)  and  (3)  of  the  procedure  are  done  by  a  Mathematica  program  which  directly  yields  the  real 
potential 


q(x)  —  —330 


e5jr ( 1 00  —  1 08e r  -  12elu  +9e,2jf) 

900 +"  1 2 1 00c l(k  -  21600eiT7  + 9801e1-r  +  9e2Zr  ’ 


x  >  0 


(97) 


as  is  depicted  in  figure  1.  This  is  an  example  where  one  of  the  eigenvalues  of  the  truncated 
potential  coincides  with  an  eigenvalue  of  the  completed  two-soliton  potential,  cf,  the  discussion 
in  the  final  part  of  subsection  2.4. 


Example  5.  Now  we  assume  r  =  q*  and  take  ft  =  — 2i,  ft  =  — 4i,  both  in  C_,aj  =  6\  = 
i,a2  =  02  —  — i.  Again  the  computer  program  yields  a  potential  in  analytic  form,  as  a  quotient 
of  sums  over  exponential  functions.  The  respective  formula,  however,  is  not  well-suited  to  be 
printed.  The  resulting  potential  has  two  singularities  in  the  unphysical  region  x  <  0  but  no 
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Figure  2.  Example  5.  Two  virtual  solitons.  r  =  q* .  Ci  =  — 2i.  C2  =  — 4i,  o\  =  6\  =  i.<72  = 
62  —  — i,  X\  =  4.040i,  X2  =  1 .9 1 7i.  k\  =  A.*,  X2  =  Xj.  Note  the  different  scales  for  the  ordinates 
left  and  right. 


singularities  on  the  right-hand  side,  see  figure  2.  This  is  what  one  would  call  a  two-virtual 
soliton  solution 


7.  Summary  and  discussion 

We  have  developed  a  method  to  solve  the  inverse  scattering  problem  for  an  AKNS  problem 
with  given  rational  reflection  coefficients  cr(£)  =  £(£)/a(0,  <?(?)  =  on  a  finite  or 

semi-infinite  interval.  The  requirements  for  the  implementation  of  this  solution  are  that  the 
matrix  exponential  function  of  a  constant  matrix  A/,  of  order  N  +  N,  has  to  be  computed  and 
two  matrices,  T11  and  T22,  of  orders  N  and  N  respectively,  must  be  inverted.  This  must  be 
done  for  all  points  at  which  the  solution  is  desired 

With  restriction  to  so-called  minimal  symmetry,  we  have  shown  that  the  reflection 
coefficients  are  consistent  with  a  right  potential,  i.e.,  vanishing  for  x  <  0,  and  that  a  left 
potential  can  be  added  so  that  the  combined  potential  is  an  N-soliton  state.  The  eigenvalues 
of  the  combined  potential  are  just  the  eigenvalues  of  the  matrix  M. 

It  has  been  pointed  out  already  in  the  introduction  that  the  procedure  presented  here  is  of 
importance  for  solving  initial-boundary  problems  by  the  use  of  the  inverse  scattering  method. 
Indeed  such  an  application  already  has  been  done  for  the  second  harmonic  generation  which 
is  mathematically  equivalent  to  degenerating  two-photon  propagation  in  the  low-excitation 
limit  [10,  11].  There  one  has  to  select  a  number  N  out  of  an  infinity  of  poles,  and  it  has  been 
demonstrated  in  the  quoted  papers  that  with  growing  N  the  approximation  of  the  solution  for 
an  initial-boundary  problem  becomes  better  and  better.  As  shown  in  the  above  references, 
one  can  obtain  solutions  for  N  as  large  as  even  60  and  depending  on  the  software,  one  could 
even  go  higher  Compared  to  [10,  1 1]  the  solution  method  here  is  essentially  generalized  and 
presented  in  a  more  systematic  form  so  that  it  is  well-suited  for  further  applications. 
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Appendix.  The  determinant  of  W 

Let  us  start  from  some  matrix  W  of  a  structure  as  it  is  given  by  equation  (40)  (subsection  2  4). 


It  consists  of  four  block  matrices, 

"'-(c  *)•  (AI) 

such  that  Im  and  ln  arc  unit  matrices  of  orders  m  and  n%  respectively,  while  B  is  an  m  x  /i-matrix 

with  m  identical  rows  (b\ . bn)  and  C is  an  n  x  m -matrix  with  n  identical  rows  (c i . cm). 

We  say  that  W  is  of  the  type  (m,  n). 

Lemma.  The  determinant  of  this  tnatrix  is  found  as 

det  W  =  I  -  (A  2) 


Proof  by  induction. 

(a)  Obviously,  the  statement  is  true  for  m  =  n  =  1. 

Then  we  have  to  prove 

(b)  It  is  true  for  (in,  n),  n  >  1  provided  it  holds  for  (m,  n  —  1). 

(c)  It  is  true  for  (m,  «),  m  >  1  provided  it  holds  for  (m  —  1,  n). 

In  order  to  prove  the  claim  (b)  we  take  the  matrix  W  and  subtract  the  row  immediately 

above  the  bottom  row  from  the  bottom  row,  so  that  the  latter  then  reads  (0 . 0,-1,  1 ). 

Next  we  add  the  right-most  column  to  the  one  next  to  it,  so  that  it  becomes 

[(*„_!  +£»„) . (*>„_ i +*>»). 0...0.  I.0]r.  (A. 3) 

and  the  bottom  row  then  is  simply  (0 _ ,0,  1).  Thus  the  determinant  of  W  is  equal  to  the 

determinant  of  a  matrix  W  of  type  (m,  n  —  1)  with 

bj  =  bi ,  i<n-  1,  hn-\=bH-\+bn%  ck  =  q,  (A. 4) 

and  point  (b)  is  true  simply  because  XT  1  bj  =  XT  bj. 

In  quite  an  analogous  way  the  claim  (c)  can  be  proven  so  that  the  Lemma  is  true  for  all 
natural  numbers  m,  n. 
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A  special  class  of  multicomponent  NLS  equations,  generalizing  the  vector  NLS 
and  related  to  the  BD.I-type  symmetric  are  shown  to  be  integrable  through 
the  inverse  scattering  method  (ISM).  The  corresponding  fundamental  analytic 
solutions  are  constructing  thus  reducing  the  inverse  scattering  problem  to  a 
Riemann- Hilbert  problem.  We  introduce  the  minimal  sets  of  scattering  data 
T  which  determines  uniquely  the  scattering  matrix  and  the  potential  Q  of  the 
Lax  operator.  The  elements  of  X  can  be  viewed  as  the  expansion  coefficients  of 
Q  over  the  ‘squared  solutions1  that  are  natural  generalizations  of  the  standard 
exponentials.  Thus  we  demonstrate  that  the  the  mapping  X  — *  Q  is  a  general¬ 
ized  Fourier  transform.  Special  attention  is  paid  to  two  special  representatives 
of  this  MNLS  with  three-component  and  five  components  which  describe  spinor 
(F  =  1  and  F  =  2,  respectively)  Bose-Einstein  condensates. 

Keywords:  Bose-Einstein  condensate,  Integrable  systems,  Recursion  operator, 
squared  solutions 

1.  Introduction 

Bose-Einstein  condensates  (BEC)  with  the  internal  degrees  of  freedom, 
the  so-called  spinor  BEC  have  attracted  much  attention  experimentally 
and  theoretically  in  recent  years.  Spinor  BEC  opens  up  a  new  paradigm, 
where  the  order  parameter  of  condensates  is  described  by  a  multicom¬ 
ponent  vector.  This  situation  can  be  realized  by  optically  trapping  cold 
atoms  where  all  hyper  fine  states  are  liberated,  while  magnetic  trapping 
freezes  its  freedom.  So  far  23 Na  (the  hyperfine  state  F  =  1),  and  87 Rb 
(F  =  2)  are  extensively  investigated,  see1-12  and  the  references  therein. 
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We  consider  BECs  of  alkali  atoms  with  hyperfine  structure  elongated  in  x 
direction  and  confined  in  the  transverse  directions  y,z  by  purely  optical 
means.  Thus  the  assembly  of  atoms  in  the  hyperfine  state  of  spin  F  can 
be  described  by  a  normalized  spinor  wave  vector  with  2 F  -f  1  components 
<&(x,t)  =  ($^(2,  £)»  ®f-i  . . . ,  £))t  whose  components  are  la¬ 

beled  by  the  values  of  rriF  =  F, . . . ,  1, 0,  —  1,. . . ,  —  F. 

The  main  tool  for  investigating  BEC  is  the  Gross-Pitaevski  (GPE)  equa¬ 
tion.  In  the  one-dimensional  approximation  the  GPE  system  goes  into  a 
multicomponent  nonlinear  Schrbdinger  (MNLS)  equation  in  ID  x-space:6 

.3$  SEqp  [$] 

(1) 

where  for  F  =  1  the  energy  functional  is  given  by: 

Egp  =  J  dxl^^\dx&\2  +  c[l$i|4  +  l$-i|4  +  2|4>0|2(|$i|2  +  |$_,|2)] 

+  (c0  -  c2)|<J>1|2|$_,|2  +  f|^o|4  +  ^(*1*!.^  +  <^2$,$-i)}(2) 

The  effective  ID  couplings  c  =  (cq  4-  02)/^,  do  =  co/2a  1,  c2  =  c2/2a5_, 
where  a±  is  the  size  of  the  transverse  ground  state.  In  this  expression, 
cQ  =  irh2{ciQ  +  2a2)/3m,  C2  =  irh2(a2  —  ao)/3m,  where  a/  are  the  s-wave 
scattering  lengths  for  the  channel  of  total  hyperfine  spin  /  and  m  is  the 
mass  of  the  atom.  We  consider  special  (integrable)  choice  for  the  coupling 
constants  Co  =  C-2  =  —  c  <  0.  This  situation  corresponds  to  attractive  mean- 
field  interaction  and  ferromagnetic  spin-exchange  interaction.  We  will  use 
dimensionless  form:  \/24>o,4>_i}r,  where  time  and  length  are 

measured  respectively  in  units  of  t  =  ha±/c  and  x  =  hyja±/‘2mc. 

For  F  =  2  the  energy  functional  is  defined  by1  3,5,11 

-  £ dI  + t"! + ?f! + ?'e|2)  ■  <3> 

where  e  =  ±1.  The  number  density  n  and  the  singlet- pair  amplitude  0  are 
defined  by3,5,11 

2 

71  =(&.&*)=  ^2  0  =  (i,  So^)  =  2^2^-2  -  4-  ^q. 

Q  =  — 2 

The  coupling  constants  Cj  are  real  and  can  be  expressed  in  terms  of 
a  transverse  confinement  radius  and  a  linear  combination  of  the  s-wavc 
scattering  lengths  of  atoms6,710  and  f  describe  spin  densities.11  Choos¬ 
ing  C2  =  0,  c\  =  1  and  cq  =  —2  we  obtain  an  integrable  version  of  the 
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corresponding  GPE  equation  by  means  of  the  inverse  scattering  transform 
method.6,13 

The  aim  of  present  paper  is  to  show  that  both  GPE  energy  function¬ 
als  (2),  (3)  correspond  to  integrable  MNLS  models  related  to  symmetric 
spaces14  of  BD.I-type:  ~  SO(n  +  2)/SO(2)  x  SO(n)  with  n  =  3  and  n  =  5 
respectively.  In  Section  2  we  formulate  the  Lax  representations  and  the 
explicit  form  of  MNLS  models,  generalizing  the  vector  NLS  for  any  n.  In 
Section  3  we  construct  the  fundamental  analytic  solutions  of  the  corre¬ 
sponding  Lax  operator  L  and  reduce  the  inverse  scattering  problem  (ISP) 
for  L  to  a  Riemann- Hilbert  problem  (RHP).  We  also  introduce  the  minimal 
sets  of  scattering  data  Xt  each  of  which  allow  one  to  reconstruct  both  the 
scattering  matrix  T( A)  and  the  corresponding  potential  q(x,  t).  In  Section  4 
we  explain  that  the  ISM  for  this  class  of  Lax  operators  can  also  be  consid¬ 
ered  as  a  generalized  Fourier  transform.  To  this  end  we  use  the  expansions 
of  q(x)  and  ad  ~]lSq  over  the  ‘squared  solutions’  of  L.  In  Section  5  we  briefly 
discuss  the  fundamental  properties  of  these  MNLS  type  equations. 

2.  MNLS  equations  for  BD.I.  series  of  symmetric  spaces 

MNLS  equations  for  the  BD.I  series  of  symmetric  spaces  have  the  Lax 
representation  [L,  M]  =  0  as  follows 

Lip(x,  t.  A)  =  idx +  (q(x,  t)  -  \  J)ip(x,  t,  A)  =  0.  (4) 

AUp(x,t,\)  =  idftp  4-  (Vo(x,  t)  +  AVi(x,£)  —  \2J)rp(x,t,  A)  =  0,  (5) 

Vi (x,t)  =  q{x,t),  Vo(x,t)  =  zad  J 1  ^  f  [ad  j'q,  q(x,  <)]  .  (6) 

We  consider  n  =  2r  4- 1  and  choose  q(x,  t)  =  Ylae a+  (</aEa  +  pQE_a)  where 
the  set  of  roots  A*  =  {ei  —  e2, . . . ,  e\  —er,  er,  e\  +  er, . . . ,  e\  +  e2}.  Eor  the 
typical  representation  we  have  the  matrix  form: 

(0  f  0  \ 

q(x,t)=lp  0  s0<f  ,  J  =  diag(l,0, . .  .0, -1).  (7) 

\  0  0  / 

The  n-component  vectors  q  =  (<72,  •  •  • ,  qn)T  are  formed  by  the  coefficients 
qa  as  follows:  qk  =  qei-ek,  <7r+i  =  <7ei  and  <jrn+i-fc  =  qex+ek,  k  = 
the  vector  p  =  (p2,  •  •  •  ,Pn)T  is  formed  analogously.  The  matrix  So  =  SqV) 
enters  in  the  definition  of  so(n),  i.e.  X  G  so(n),  if  X  4-  Sq1^ XT Sq1)  =  0, 
and  for  n  —  2r  4- 1 : 

n+1 

s= 1 


(8) 
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With  this  definition  of  orthogonality  the  Cartan  subalgebra  generators  are 
represented  by  diagonal  matrices.  By  E^p  above  we  mean  n  x  n  matrix 
whose  matrix  elements  are  (Eip^)ij  =  SslSPj.  In  terms  of  these  notations 
the  generic  MNLS  type  equations  connected  to  BD  I.  acquire  the  form 

iqt  4  qxx  4  2 (<7,  p)q-  (<7,  s0q)s0p  =  0, 
ipt  ~  Pxx  -  2 (q,  p)p  4  (p,  s0p)s0q  =  0. 

This  equation  allows  the  typical  reduction  p  =  q* .  The  Hamiltonian  for 
these  MNLS  equations  (9)  is  given  by 

/oo 

dx  ((dxq,dxq*)  -  +  (q,  s0q){q\  s0<f*)) .  (10) 

-OO 

For  r  =  2  we  introduce  the  variables  <Li  =  <72,  $o  =  q$/\/2,  4_i  =  q 4;  for 
r  =  3  we  set  $2  =  <72 ,  $1  =  <73,  $0  =  <74,  $-1  =  <75  and  <f>_2  =  qe-  This 
reproduces  the  action  functionals  Eqp  for  F  =  1  and  F  =  2. 


3.  The  Inverse  scattering  problem. 

Solving  the  direct  and  the  inverse  scattering  problem  (ISP)  for  L  uses  the 
Jost  solutions  which  are  defined  by,  see16  and  the  references  therein 

lim  <j>{x,t,\)elXJx  =  1,  lira  t,  \)elXJx  =  1  (11) 

x  — ► — oc  x— *oo 


and  the  scattering  matrix  T(\A)  =  t,A).  The  choice  of  J  and  the 

fact  that  the  Jost  solutions  and  T( A,  £)  take  values  in  the  group  SO(n  4  2) 
means  that  we  can  use  the  following  block-matrix  structure  of  T( A,  t) 


( 

-b-T 

C1  \ 

771  j 

Cl  \ 

T(A,*)- 

b+ 

T22 

f  (A,  t)  = 

T22 

•s06- 

ct 

B  +  rs0 

m7  / 

—b+T  sq 

m+  / 

(12) 

where  ^(A,  £)  and  /^(A.i)  are  n-coinponent  vectors,  T22(A)  and  m±(A) 
are  n  x  n  block  matrices,  and  mf(\)y  cf  (A)  are  scalars.  Such  parametriza- 


tion  is  compatible  with  the  generalized  Gauss  decompositions  of  T( A). 

T(  A,  t)  =  TJ  DjSj  =  T+DjSJ,  (13) 


1  0  0] 

^  1  f+'T  Cj’  \ 

II 

e-T 

P+  no 
(c'4  Pf'rs0  1  j 

II 

+-> 

co 

0  n  sof+ 

(00  1  y 

SJW  =  (sj( A*))' ,  T+( A)  =  (f7(A’))f ,  DJ (A)  =  (D+(A*))f , 
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=  ^(p^sop*),  C*  =  ^(f^sof*) 
where  the  block-diagonal  matrix  D+  =  l>diag  (m^,  mj,  1  /m+). 


(15) 


b+ 


T  '  = 


+  ’ 


(16) 

M  mi 

The  matrix  elements  of  T( A)  satisfy  a  number  of  relations  which  ensure  that 
both  T( A)  and  its  inverse  T( A)  belong  to  SO(n  +  2)  and  that  T(X)T(X)  =  H. 
Some  of  them  take  the  form: 

mt m\  +  (b~ ,  B+)  +  ctc\  —  1*  b+  B~ 1  4-  T22S0T22S0  4-  sqB~  b+7  sq  =  1. 


Important  tools  for  reducing  the  ISP  to  a  Riemann- Hilbert  problem  (RHP) 
are  the  fundamental  analytic  solution  (FAS)  x±(x,  U  A).  Their  construction 
is  based  on  the  generalized  Gauss  decomposition  of  T(X,t),  see:17  '19 

X±{x ,  t,  X)  =  4>{x,  U  A )Sf  (t,  X)  =  ip{x,  t,  X )Tj(t,  X )D±  (A).  (17) 

If  q(x,t)  evolves  according  to  (9)  then  the  scattering  matrix  and  its 
elements  satisfy  the  following  linear  evolution  equations 

+A2pf(<,A)  =  °,  i^--\2fi-(t,\)  =  0,  =  0,  (18) 

so  the  block-diagonal  matrices  ^(A)  can  be  considered  as  generating  func¬ 
tionals  of  the  integrals  of  motion.  The  fact  that  all  (2 r—  l)2  matrix  elements 
of  m2  (A)  for  A  €  C±  generate  integrals  of  motion  reflect  the  superintegra- 
bility  of  the  model  and  are  due  to  the  degeneracy  of  the  dispersion  law  of 
(9).  We  remind  that  (A)  allow  analytic  extension  for  A  €  C±  and  that 
their  zeroes  and  poles  determine  the  discrete  eigenvalues  of  L. 

The  FAS  for  real  A  are  linearly  related 

X+(x,  t,  X)  =  *-(*,  U  X)Goj(X,  t)y  G0ij(A,  t)  =  SJ( A,  t)S+( A,  t).  (19) 

and  satisfy  the  normalization  relation  x±{x,ty  X)elXJx  =  H.  Then 

these  FAS  satisfy 

£+(x,  t ,  A)  =  U  X )Gj{x,  A,  0,  Gj(x,  A,  t)  =  e~iXJrGQj(X,  t)eiXJx. 

(20) 

Obviously  the  sewing  function  Gj(x,X,t)  is  uniquely  determined  by  the 
Gauss  factors  S±(X,t). 


Lemma  3.1.  Let  the  potential  q(x,t)  is  such  that  the  Lax  operator  L  has 
no  discrete  eigenvalues.  Then  as  minimal  set  of  scattering  data  which  deter¬ 
mines  uniquely  the  scattering  matrix  T(A,  t)  and  the  corresponding  potential 
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q(x,t)  one  can  consider  either  one  of  the  sets  T 2=1,2 

={p+(A,t),  a€A+},  T2  =  {r+(A,0,  a  e  A+},  A  €  R. 

Given  the  solutions  x±{x,t.  A)  one  recovers  q(x,t)  via  the  formula 

q(x,t)  =  lim  A  (J  —  x±J1x±{x^  f,  A))  .  (21) 

The  main  goal  of  the  dressing  method1'  20-22  is,  starting  from  a  known 
solutions  Xo  (x'  ^  °f  ^o(A)  with  potential  q^(x,t)  to  construct  new  sin¬ 
gular  solutions  i,  A)  of  L  with  a  potential  q(\)(x,  t)  with  two  (or  more) 
additional  singularities  located  at  prescribed  positions  A^;  the  reduction 
p  =  q*  ensures  that  Aj~  =  (A]1")*.  It  is  related  to  the  regular  one  by  a 
dressing  factor  u(x,t.  A),  for  details  see.13 


4.  The  Generalized  Fourier  Transforms  for  Non-regular  J 

It  is  known  that  the  ‘squared  solutions’  e^(x,  A)  =  7Tq  j  (x±  Eax±{x-,t,  A)), 
where  7r 0j-  =  adJ^adj-,  form  complete  set  of  functions  in  the  space  of 
allowed  potentials  q(x),  see.16  18  19  Skipping  the  details  we  write  down  the 
expansions  of  q(x)  and  ad  JlSq(x)  assuming  L  has  no  discrete  spectrum: 


roo 

q(x)  =  -~  d\  Y  (r«  M^Or.A)  -  r-(A)eIa(x,A)) , 

**  J —  oo 


(22) 


aeAf 


r  DC 

adj16q(x)  =  -  d\  Y  (St*  (A)eJ (*- A)  +  St~ (\)eZa(x,  A)) .  (23) 

"  J  —  OO 


Qr€  Aj 


These  expansions  can  be  viewed  as  tool  to  establish  the  one-to-one  cor¬ 
respondence  between  q(x)  (resp.  ad  ~]lSq  and  each  of  the  minimal  sets  of 
scattering  data  (resp.  (ST*),  i  =  1, 2.  To  complete  the  analogy  between  the 
standard  Fourier  transform  and  the  expansions  over  the  ‘squared  solutions’ 
we  need  the  generating  operators  A±: 


A±X(:r)  =  adj  +7 


dX 


q(x),  [  dy[q{y),X(y)\ 

J  ioo 


for  which  the  ‘squared  solutions’  are  eigenfunctions: 


(24) 


(A+  -  A)etft  (x,  A)  =  0,  ( A+  -  A)e„  (x,  A)  =  0. 

(A_  —  A)eJ  (x,  A)  =  0.  (A_  —  A)ela(x,  A)  =  0. 


(25) 
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5.  Fundamental  properties  of  the  MNLS  equations 

The  expansions  (22),  (23)  and  the  explicit  form  of  A±  and  eq.  (25)  are 
basic  for  deriving  the  fundamental  properties  of  all  MNLS  type  equations 
related  to  the  Lax  operator  L.  Each  of  these  NLEE  is  determined  by  its 
dispersion  law  which  we  choose  to  be  of  the  form  F( A)  =  /(A)J,  where 
/(A)  is  polynomial  in  A.  The  corresponding  NLEE  becomes: 

iadj'qt  +  f(A±)q(x,t)  =  0.  (2G) 


Theorem  5.1.  The  NLEE  (26)  are  equivalent  to:  i)  the  equations  (18) 
and  ii)  the  following  evolution  equations  for  the  generalized  Gauss  factors 
ofT(X): 

+  =  +  771-0.  ^  =  0.  (27) 

The  principal  series  of  integrals  is  generated  by  the  asymptotic  expan¬ 
sion  of  lnm^ (A)  =  Yl'kLi  ^A-*.  The  first  three  integrals  of  motion: 

i  f°°  ■  1  f°° 

h  =  ~„  dx(q(x),q(x)),  h  =  -z  /  dx  (qx(x),&dj1q(x)),  (28) 

“  J  — oo  ^  J  —  oo 

Now  ill  can  be  interpreted  as  the  density  of  the  particles,  I2  is  the  mo¬ 
mentum  and  / 3  =  e//\iNLS  Indeed,  the  Hamiltonian  equations  of  motion 
provided  by  //( 0)  =  —ih  with  the  Poissson  brackets 

{Qk(y,t),Pj(x,t)}  =  i5kjS(x  -y).  (29) 


coincide  with  the  MNLS  equations  (9).  The  above  Poisson  brackets  are  dual 
to  the  canonical  symplectic  form: 

ilo  =  i  J  dxtr  ^8p(x)  ASq(x)^  =  ^  [[ad  jl6q(x)  Aad  J1^^)]], 

where  A  means  that  taking  the  scalar  or  matrix  product  we  exchange  the 
usual  product  of  the  matrix  elements  by  wedge-product. 

The  Hamiltonian  formulation  of  cq.  (9)  with  Qo  and  Ho  is  just  one 
member  of  the  hierarchy  of  Hamiltonian  formulations  provided  by: 

ilk  =  |  [[ad  Afcad  J *«?]],  Hk  =  ik+3Ik+3.  (30) 


where  A  =  A(A+  +  A_).  We  can  also  calculate  llk  in  terms  of  the  scattering 
data  variations.  Imposing  the  reduction  q(x)  =  q^(x)  we  get: 


1  r 
nk  =  —  / 

7-o 

_L  [°° 

~  27T  7-oc 


d\\k  (12J(A)  (A)) 

) 

dX  AfcIm  (m+(A)  (m2+<5p+(A)  A^A)))  . 
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This  allows  one  to  prove  that  if  we  are  able  to  cast  Hq  in  canonical  form,  then 
all  will  also  be  cast  in  canonical  form  and  will  be  pair-wise  equivalent. 
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Abstract 

We  consider  several  ways  of  how  one  could  classify  the  various  types  of  soliton 
solutions  related  to  multicomponent  nonlinear  evolution  equations  which  are 
solvable  by  the  inverse  scattering  method  for  the  generalized  Zakharov-Shabat 
system  related  to  a  simple  Lie  algebra  g.  In  doing  so  we  make  use  of 
the  fundamental  analytic  solutions,  the  Zakharov-Shabat  dressing  procedure, 
the  reduction  technique  and  other  tools  characteristic  for  that  method  The 
multicomponent  solitons  are  characterized  by  several  important  factors:  the 
subalgebras  of  g  and  the  way  these  subalgebras  are  embedded  in  g,  the 
dimension  of  the  corresponding  eigensubspaces  of  the  Lax  operator  L,  as  well 
as  by  additional  constraints  imposed  by  reductions. 

PACS  numbers:  05.45.  Yv,  02.30.Zz,  02.20.Sv,  02,30.1k 


1.  Introduction 

Since  the  time  when  the  inverse  scattering  transform  was  invented  the  number  of  integrable 
nonlinear  evolution  equations  (NLEE)  has  been  increasing  significantly  [1-3].  There  exist 
different  approaches  in  analyzing  these  equations:  constructing  the  spectral  theory  of  the  so- 
called  recursion  operators  from  Lax  scattenng  operators,  studying  whether  there  exist  higher 
order  integrals  of  motion  and  so  on. 

Along  with  the  number  of  integrable  equations  the  diversity  of  ‘species'  of  soliton  solutions 
has  bloomed  up:  breathers,  topological  solitons,  trapons,  boomerons,  etc  Thus  the  analogous 
problem  of  classifying  solutions  to  nonlinear  evolution  equations  and  soliton  solutions,  in 
particular,  becomes  more  and  more  important.  It  is  our  opinion  that  the  mentioned  problem 
is  still  waiting  for  its  solution.  Even  for  some  of  the  best  known  NLEE  such  as  the  N- wave 
equations,  the  multicomponent  nonlinear  Schrodinger  equations,  multicomponent  modified 
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KdV  equations,  etc  only  the  simplest  types  of  soliton  solutions  are  known.  Our  aim  is  to 
outline  several  criteria  for  the  classification  of  the  soliton  solutions. 

We  use  the  term  ‘soliton  solution'  as  a  special  solution  to  a  given  NLEE  which  is  solvable 
by  the  inverse  scattering  method  [1-3],  i.e  it  allows  a  zero  curvature  representation 

IL(X),  M (*)]  =  0,  (1) 

where  L(X)  and  M (X)  are  two  linear  differential  operators  In  what  follows  we  take  them  to 
be  first-order  matrix  differential  operators, 

L\l/(x%  /,  X)  5  \dxip  +  U ( x ,  /,  X)y/f{: c,  /,  A.), 

M\I/(x,t,X)  =  \dtyl/  +  V(x ,  /,  X)\//(x,  /,  X). 

The  soliton  solutions  are  related  to  a  set  of  several  discrete  eigenvalues  of  the  Lax  operator 
L.  Therefore  one  first  has  to  describe  all  configurations  of  discrete  eigenvalues  of  L,  see  [4]. 
The  next  step  in  classifying  the  types  of  one-soliton  solutions  is  related  to  the  study  of  their 
internal  degrees  of  freedom 

In  order  to  make  the  problem  not  too  difficult  we  will  specify  L  to  be  the  operator  for  the 
generalized  Zakharov-Shabat  system 

L(X)\j/(x,  X)  s  i dx\{/  +  (q(x)  —  XJ)\//(x,  X)  =  0, 

where  we  take  the  potential  an  q(x ,  /)  to  be  an  n  x  n  matrix-valued  smooth  function  of  x 
tending  to  zero  sufficiently  rapid  as  x  — >  ±oc.  We  also  restrict  J  to  be  a  real  constant  diagonal 
matrix  with  different  eigenvalues. 

For  simplicity  we  consider  the  class  of  Lax  operators  of  Zakharov-Shabat  type  in  most 
cases  with  real-valued  J .  In  doing  this  we  will  be  using  the  dressing  method  [5-7],  one  of 
the  best  known  methods  for  constructing  refiectionless  potentials  and  soliton  solutions.  This 
paper  is  intended  as  a  natural  continuation  of  the  work  [8]  published  several  years  ago  by  two 
of  the  authors. 

In  section  2,  we  outline  preliminary  facts  about  the  generalized  Zakharov-Shabat  systems 
related  to  the  sl(n)  algebras.  We  also  review  the  basic  facts  for  the  sl( 2)  soliton  solutions,  and 
especially  the  topological  and  the  breather  solutions. 

In  section  3,  we  treat  the  different  types  of  one-soliton  solutions  for  the  sl(n)  Zakharov- 
Shabat  systems  starting  with  n  =  3.  Along  with  the  well-known  soliton  solutions  obtained  by 
Zakharov  and  Manakov  [9]  and  Kaup  [10]  we  show  that  the  3-wave  equations  with  aZ2  x  Z2 
symmetry  allow  doublet  and  quadruplet  soliton  solutions  (analogies  to  the  topological  and 
breather  solutions  mentioned  above).  Next  we  analyze  jV-wave  equations  related  to  sl(n ) 
algebra  with  n  =  5;  of  course  nearly  all  formulae  can  easily  be  generalized  for  any  n.  Here  we 
use  generic  projectors  of  rank  s  ^  1  and  explain  why  it  is  enough  to  consider  only  s  ^  [n/2] 
where  [n/2]  is  the  entire  part  of  n/2.  We  also  demonstrate  that  using  special  choices  for  the 
polarization  vectors  defining  the  projector  we  can  get  one-soliton  solutions  <?is(.v,  /)  taking 
values  in  a  subalgebra  of  sl(n)  The  simplest  nontrivial  type  of  solitons  can  be  related  to  the 
subalgebra  5/(2);  we  call  them  typical  5/(2)  solitons.  Our  next  observation  is  that  the  5/(2) 
subalgebra  can  be  embedded  in  several  different  inequivalent  ways  into  5/(5).  For  example 
we  can  embed  a  spin  J  representation  of  5/(2),  J  ^  2  into  5/(5);  we  call  them  spin  J  5/(2) 
solitons.  The  corresponding  construction  is  done  by  using  the  corresponding  symmetrized 
tensor  products  of  the  5/(2)  dressing  factor.  The  next  step  is  to  consider  typical  5/(3)  and  5/(4) 
solitons.  Of  course,  like  for  the  5/(3)  algebra,  using  additional  ^-symmetries  one  can  obtain 
/V-wave-type  equations  allowing  doublet  and  quadruplet  solitons. 

In  section  4,  we  analyze  the  structure  of  the  eigenfunctions  of  L(X)  corresponding  to  the 
different  types  of  solitons. 
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In  section  5,  we  discuss  the  effects  of  several  types  of  reductions  [11]  on  the  different 
types  of  one-sol iton  solutions.  The  first  Z2-reduction  does  not  affect  the  spectral  parameter  X 
and  is  produced  using  the  exterior  automorphism  of  5/(5).  Its  effect  is  that  now  q(x,  t)  —  XJ 
takes  values  in  the  subalgebra  50(5)  C  5/(5).  Here  we  briefly  recall  the  soliton  solutions  of 
the  50(5)  4-wave  equations  we  derived  in  [12].  Next  we  apply  an  additional  pair  of  Z2  x  Z2 
reductions  we  obtain  a  4-wave  system  allowing  doublet  and  quadruplet  solutions.  At  the  end 
of  this  section  we  analyze  the  NLEE  related  to  the  symmetric  spaces  [13-15]  of  BD  I  type;  the 
corresponding  J  in  the  Lax  pair  is  dual  to  the  vector  e\  in  the  root  space.  One  easily  checks  that 
for  such  J  the  V-wave  equations  become  linear.  However  there  are  multicomponent  mKdV- 
type  equations  which  may  be  called  vector  mKdV  equations.  Such  equations  have  been  shown 
to  possess  higher  symmetries  and  have  been  related  to  Jordan  algebras  [lb].  The  simplest 
nontrivial  example  is  a  three-component  mKdV  related  to  the  algebra  50(5).  We  apply  an 
additional  Z2-reduction  on  it  using  an  automorphism  related  to  a  specific  Weyl  reflection  and 
obtain  a  two-component  vector  mKdV  which  to  the  best  of  our  knowledge  is  new.  We  end  the 
section  by  deriving  for  it  the  doublet  and  quadruplet  solutions. 

In  section  6  we  summarize  our  results  and  discuss  their  possible  generalizations.  In  the 
appendices,  we  collect  some  technical  details  about  the  symmetric  spaces  of  BD.I  type  and 
about  the  spin  J  representations  of  5/(2). 

2.  Preliminaries 

2.  /.  The  generalized  Zakharov-Shabat  system  related  to  sl{n) 

In  this  section,  we  shall  outline  some  basic  features  of  the  mathematical  machinery  we  are 
about  to  use  for  the  classification  of  soliton  solutions. 

lntegrability  or  more  precisely  S-integrability  of  a  NLEE  means  that  the  NLEE  can  be 
presented  as  a  zero  curvature  condition 

[L(X),M(X)]  =  0  (2) 

of  two  first-order  linear  matrix  differential  operators  L{X)  and  M{X)  of  the  form 

L\l/(x ,  /,  X)  =  (idjt  +  U(x ,  /,  X))\l/( jc,  /,  A.)  =  0,  (3) 

M t/t (jc,  /,  X)  s  (id;  +  V (jc,  /,  X))\//(x ,  /,  X)  =  t/f(JC,  /,  X)C(X).  (4) 

The  potentials  U  (jc,  /,  X)  and  V  (jc,  /,  X)  are  typically  chosen  as  elements  of  some  semisimple 
Lie  algebra  g  (the  fundamental  solutions  ^(jc,  /,  X)  belong  to  the  corresponding  Lie  group  G). 
We  shall  mainly  deal  with  the  algebra  sl(n). 

Remark  1.  The  compatibility  condition  (2)  means  that  the  Lax  operators  L  and  M  possess  the 
same  eigenfunctions.  The  matrix  C(A.)  depends  on  the  definition  of  Jost  solutions. 

Since  the  compatibility  condition  (2)  must  hold  true  identically  with  respect  to  X  one  can 
verify  that 

iSx  V  dtU  +  [//(jc,  /,  X),  V(jc,  /,  X)]  =  0  (5) 

and  it  is  valid  for  any  choice  of  C(X).  For  simplicity  we  shall  restrict  our  considerations  on 
scattering  operators  of  the  Zakharov-Shabat  type  (GZS) 

L{X)\j/{ jc,  /,  X)  =  (id*  +<7(jc,  /)  —  XJ)\j/{ jc,  /,  X)  =  0.  (b) 

The  matrix  J  is  a  real  traceless  diagonal  matrix,  i.e.  a  real  Cartan  element  of  s/(n),  while 
^7 (jc,  /)  is  a  matrix  with  zero  diagonal  elements.  Since  J  is  a  real  matrix  one  can  introduce  an 
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ordering  of  its  elements  J\  >  Ji  >  ■  ■  •  >  7„.  By  carrying  out  a  gauge  transformation  which 
commutes  with  7,  we  can  always  take  q(x,  t)  to  be  of  the  form  q(x ,  /)  =  [7,  Q(: t,  /)],  i.e. 
qjj  =  0.  The  linear  subspace  in  sl(n)  of  matrix-valued  functions  q(xj)  =  [7,  Q(x ,  /)]  is 
known  in  the  literature  to  be  the  co-adjoint  orbit  of  g  passing  through  7.  The  co-adjoint  orbits 
can  be  supplied  in  a  natural  way  with  a  non-degenerate  symplectic  structure  which  makes 
them  natural  choices  for  the  phase  spaces  Mj  of  the  corresponding  NLEE. 

The  class  of  NLEE  related  to  L(k)  are  systems  of  equations  for  the  multicomponent 
functions  Q(x ,  /),  which  may  be  written  in  the  compact  form  [9,  10,  17,  18] 

4 

i3,e+2^A*[ff*.G(Jt./)]  =  0.  (7) 


where  //*,  tr  Hk  =0  are  constant  diagonal  matrices  and  fiX)  =  ]l*=i  Hk  is  the  dispersion 
law  of  the  NLEE,  Here  and  below  we  define 


(ad  jX)ks  =  ([ J ,  X])*s  =  (Jk  -  J,)Xks .  (adj'X)tj  = 


Xfa 

Jk  ~  J.t 


(8) 


for  all  X  e  Mj<  i.e.  X**  =  0.  The  operator  A  is  either  one  of  the  recursion  operators  A± , 
acting  on  the  space  M  j  of  n  x  n  off-diagonal  matrix-valued  functions  as  follows: 

A±X  =  ad^'  ^idxX+ Ptt[q.X]  +  \JjQ.Ek.k]  d.v  tr([£>,  X],  Ek,k)j  .  (9) 


where  Pq  is  the  projector  adj1  ad./ (£*  *)//  =  dk  jdjj-  Choosing  H\  =  /  =  diag( /j ,  ^ 

1 ,  so  that  the  dispersion  law  f{X)  =  —XI  is  a  linear  function  of  X  we  get  a  system,  generalizing 
the  well-known  N-wave  equation 


i IJ.  Qt]-i[l,Qx]-UJ<  QllPQ  11  =  0, 


(10) 


which  contains  N  =  n(n  —  \)  complex-valued  functions  Qtj{x<  /). 

In  order  to  describe  the  soliton  solutions  we  shall  use  the  so-called  dressing  procedure 
[7].  For  that  purpose  we  need  some  basic  facts  on  the  direct  scattering  problem  for  L. 

Let  y/f±(x,  /,  X)  are  two  fundamental  solutions  of  the  GZS  system  (6).  If  they  satisfy  the 
requirement 

lim  ^±(x,t,k)e'Ux  =  I  (11) 

X— ♦  rfcoo 

they  shall  be  called  Jost  solutions.  The  Jost  solutions  are  interrelated  via 


c,  /,  X)  —  \//+(x,  /,  X)T(t,  X ), 


(12) 


where  T(/,  X)  is  called  a  scattering  matrix.  The  scattering  matrix  is  Jt-independent  and  its  time 
evolution  is  driven  by  the  linear  equation 

i3,7'  +  [/(A.).7']  =  0  =>  T(t,k)  =  e'fa)lT(0,  X)e~'fWl.  (13) 

Equation  (13)  shows  how  one  can  recover  the  time  evolution  of  the  scattering  data.  It  is  used 
to  solve  Cauchy’s  problem  for  NLEE  as  it  is  displayed  below, 

q(t  =  0)  ^  L{t  =  0)  ^  Tit  =  0)  -  Tit)  -  Lit)  q(t).  (14) 

Due  to  this  we  shall  skip  the  /-dependence  of  all  functions  (potentials,  fundamental  solutions, 
etc)  regarding  them  at  a  fixed  moment  /  =  to. 

The  set  of  matrix  elements  of  T iX)  must  satisfy  a  number  of  relations.  Indeed,  they  are 
uniquely  determined  by  Qix)y  i.e.  by  /?(/?  —  1)  complex  functions  of  x ,  so  it  seems  natural 
that  there  should  not  be  more  than  n(n  —  1)  independent  functions  among  TjkiX)  for  X  on 
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the  real  axis.  Of  course,  T  (X)  must  satisfy  the  ‘unitarity’  condition  det  T(X)  —  1  The  rest 
of  these  relations  follow  from  the  analyticity  properties  of  certain  combinations  of  the  matrix 
elements  of  T  (X).  These  analyticity  properties  must  follow  naturally  from  the  corresponding 
fundamental  analytic  solutions  (FAS)  x±(x>  A.). 

The  Jost  solutions  are  well  defined  only  for  X  e  R,  i.e.  they  do  not  have  necessarily 
analytic  properties  beyond  the  real  axis.  This  can  be  easily  seen  if  one  reformulates  the 
problem  (6)  in  terms  of  a  Volterra-type  integral  equation 

$±(x,k)  =  l+i  [  d ye'kMy~x)q(y)t;±(y,VeikH'-y\  (15) 

J±  00 

where  £±(jc,  X)  =  i/±(x,  X)  elkJx  represents  another  set  of  fundamental  solutions  but  this  time 
to  the  linear  problem 

idx$(x,X)+q(x)(;(x9X )  -A[y,f(x,A)l  =  0. 

It  is  easy  to  see  that  only  the  first  and  the  last  columns  of  if/+(x,X)  and  rfr-(x,  X)  allow 
analytic  extensions  in  X  off  the  real  axis;  generally  the  other  columns  do  not  have  analyticity 
properties.  Nevertheless  it  is  possible  to  introduce  FAS  [3,  19]  Taking  into  account  the 
ordering  introduced  above  one  is  able  to  construct  new  fundamental  solutions 


r.A)  =  ' 

Ski  +  i 

fx 

[  d  ye'X(J'-J')iy-x)(qS±)u(y,k)' 
J±oo 

k  ^  /, 

(16) 

-  J  TOO 

dyei 

k  >  l 

to  possess  analytic  properties  in  the  half  planes  C±  of  the  spectral  parameter.  This  definition 
can  be  rewritten  using  the  Gauss  factors  of  the  scattering  matrix  T 

X±(x,X)  =  WS±M  =  1r+(x.X)T*{X),  (17) 

where  T (X)  =  T:F(A)(§±(A.))" 1  and  =  ^±(x,X)e~lkJx.  The  matrix  elements  of 

T^A)  and  S±(A)  can  be  expressed  in  terms  of  the  minors  of  T(X).  Here  we  note  that  their 
diagonal  elements  can  be  given  by 

=  mJ_,(X).  TjjW  =  m}(A),  (18) 


T ]j(X)  =  SJ/A.)  =  (19) 

where  /nj  =  /n*  =  1  and  by  m^(k)  (resp.  nt^(k))  we  have  denoted  the  upper  (resp.  lower) 
principal  minors  of  T  (X)  of  order  k ,  e.g.. 


mt(X) 


1  2  ...  k 1 

1  2  ...  k\' 


k=  1 . n 


mk  {X)  = 


In  —  k  +  1 
\n  -k  +  1 


n-k+2 

n-k+2 


(Tij, 

•  •  Ti ,  jk  \ 

j/l  l2 

•  •  «*  i 

=  det 

Thh 

Thh 

Ti2jk 

\ji  h  • 

■  •  jk  i 

1  T{k) 

Tikji  * 

TiJ 

(20) 

(21) 


(22) 


As  a  consequence  of  the  analyticity  of  the  FAS,  it  follows  that  the  minors  m£(A)  (resp. 
mk(X))  are  analytic  functions  for  X  e  C+  (resp.  for  X  e  C_). 
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One  can  construct  the  kernel  of  the  resolvent  of  L(X)  in  terms  of  the  FAS  [17,  18]  from 
which  it  follows  that  the  resolvent  has  poles  for  all  xf  which  happen  to  be  zeroes  of  any  of 
the  minors  mf(X).  Therefore  what  we  have  now  is  that  each  of  the  minors  mf(X)  may  be 
considered  to  be  an  analog  of  the  Evans  function,  and  thus  now,  there  is  more  than  one  Evans 
function,  with  each  Evans  function  in  a  one-to-one  correspondence  with  one  of  the  minors, 
mf(X). 

There  exist  different  methods  to  solve  a  NLEE  possessing  a  Lax  representation:  Gel’fand- 
Levitan-Marchenko  integral  equation,  Hirota  method,  dressing  method,  etc.  We  shall  use  the 
dressing  Zakharov-Shabat  method  [7].  Let  A)  be  a  fundamental  solution  of  Zakharov- 
Shabat's  system  with  a  known  potential  Uq(x ,  X)  =  <7o(*)  —  X7 .  Consider  a  new  function 
\ls(x,X)  =  u{xyX)\!/ o(jc,  X)  which  is  a  solution  to  Zakharov-Shabat ’s  problem  with  some 
potential  q(x)  —  XJ  to  be  found.  This  requires  that  m(j t,  X)  satisfies 

\dxu  +  qu  —  uqo  —  X[J,u]  =  0.  (23) 

The  dressing  procedure  transforms  the  Jost  solutions  X),  the  scattering  matrix  T0(X) 

and  the  fundamental  solution  Xq(x *  °f  the  generalized  Zakharov-Shabat  system  with  a 
potential  Uq(x,  X)  in  the  following  fashion: 


f±(x.  X)  =  u(x.  \)f±.o(x,  X)ul'(X). 

(24) 

T(X)  =  u+(X)Ta(\)uZl(X), 

(25) 

X±(x,  X)  =  u(x,  X)x(f(x,  X)mI'(X). 

(2b) 

The  normalizing  factors  w±(X)  =  \\mx^±OQu(x,  X)  ensures  the  proper  asymptotics  of  the 
dressed  solutions  \j/±(x ,  X). 

Our  aim  is  by  using  the  structure  of  the  dressing  factor  to  try  to  classify  the  soliton 
solutions  of  NLEE.  A  classical  ansatz  [3]  for  m(jc,  X)  is  the  following  one: 

u(x,X)  =  I  +  (c(X)  -  1  )P(x),  c(X)  =  (27) 

where  P  is  a  projector  ( P 2  =  P )  which  can  be  expressed  via  the  fundamental  analytic 
solutions  (FAS)  and  X+  (resp.  X~)  is  an  arbitrary  complex  number  in  the  upper  (resp.  lower) 
half  plane  C+  (resp.  C_).  After  applying  the  dressing  procedure  X±  become  a  pair  of  discrete 
eigenvalues  for  the  dressed  Lax  operator  The  projector  P(x)  determines  the  corresponding 
eigensubspaces — the  rank  s  of  P  equals  the  dimension  of  the  eigensubspaecs.  If  we  introduce 
two  sets,  each  containing  s  vectors 

I'M) . \nx):  (/'ill . (ms  |, 


we  can  write  P  as  follows: 

.T 

P  =  \na)Mab(mb\,  Mab  =  (ma\nh).  M  =  M~' . 

a.b=  I 

Obviously  the  two  sets  of  vectors  are  the  left  and  right  eigenvectors  of  P ,  namely 
P\na)  =  |na).  (mh\P  =  {mh\. 

for  all  a.  b  =  1 . s.  In  the  simplest  case  when  rankP  =  1  it  reads 

_  |nU))(mU)| 

U>  "  <m(x)|«(*))  ’ 

where 


(28) 


(29) 

(30) 
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|fl(*)>  =  Xo+U-3.+)l«o>. 


(w(x)|  =  {moKXo  (*■•  ))“’• 


(31) 
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By  taking  the  limit  k  oo  in  equation  (23)  we  obtain  an  interrelation  between  the  seed 
solution  <70  and  the  new  one 

q=qo  +  (k~  -k+)[Jy  P]  (32) 

Thus  starting  from  a  known  solution  of  the  NLEE  we  can  find  another  solution  by  simply 
dressing  it  with  some  factor  w  (a  ,  X).  An  important  particular  case  is  when  <70  =  0.  The  dressed 
solution  is  called  a  one-soliton  solution.  The  fundamental  analytic  solution  in  the  soliton  case 
is  given  by  a  plane  wave  Xo  (*»  =  exp(— iX7;c).  Repeating  the  same  procedure  one  derives 

step  by  step  the  multisoliton  solution  of  the  corresponding  NLEE,  i.e., 

0  ->  q" -*  q2* -* - *  <?"*.  (33) 

Many  integrable  equations  correspond  to  Lax  operators  that  obey  some  additional 
symmetry  conditions  of  algebraic  nature.  That  is  why  it  is  worthwhile  to  outline  some 
aspects  of  the  theory  of  such  Lax  operators. 

Let  an  action  of  a  discrete  group  Gr  to  be  referred  to  as  a  reduction  group  be  given  on 
the  set  of  fundamental  solutions  to  the  generalized  Zakharov-Shabat  system  (6)  as  follows: 


ir(x,k)  =  K^(x,k(k))K~\  (34) 

where  k  :  C  — ►  C  is  a  conformal  map.  This  action  yields  another  action  on  the  potential  in 
the  scattering  operator  L, 

KU(x,k(k))K-'  =  U(x,X).  (35) 

A  common  case  is  when  Gr  =  Z2.  Then  the  action  of  Z2  might  involve  external 
automorphisms  of  SL(n )  as  well, 

#(jt,A.)  =  K(fT(x.kiW)y  'K~'  =>  KUT{x.k\{X))K~x  -  -U(x,X),  (36) 

\fr(x,X .)  =  K1r*(x,k2(k))K-'  =>  KU*(x,  k2(k))K~l  =  -U(x.  X).  (37) 


In  particular,  if  Z2  acts  trivially  on  the  complex  plane  of  the  spectral  parameter,  i.e.  k  =  id , 
then  the  symmetry  condition  (36)  may  restricts  the  potential  U (a,  k)  to  a  certain  subalgebra  of 
sl(n).  For  example,  suppose  KT  =  5b A' 5o  then  U (a,  k)  belongs  to  the  orthogonal  algebra3 
so(n).  The  existence  of  a  Z2  reduction  requires  a  modification  of  the  dressing  factor  m(a,  X) 
as  follows: 


m(a,  k)  =  1  +  (c(A)  —  \)P(x)  + 


(55  -  0 


where  P( a)  is  a  projector  of  rank  1, 
|n(j:))(m(j:)| 

P(x)  - 


P(x)  =  KS{)Pt(x)S()'K- 


(m(x)\n(x)) ' 

The  projector  itself  can  be  expressed  through  the  FAS  x 1  f.v .  X), 


(38) 


(39) 


l«U)>  =  XqU.  ^+)|no).  <m(x)|  =  <m0|(x0  (*■  x  ))  '•  (40) 


3  Here  we  use  a  slightly  modified  definition  of  the  orthogonal  algebras,  see  appendix  A  where  the  definition  and  the 
explicit  form  of  the  matrix  Sq  are  given. 
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2.2.  The  Zakharov-Shabat  system  and  5/(2)  solitons 

The  best  known  examples  of  NLEE  are  related  to  the  Zakharov-Shabat  system  which  is 
associated  with  the  5/(2)  algebra  as  follows: 


Li/f(jt,  /,  k)  =  (i3x  +  q{ x.  t)  —  jc,  /,  X), 


(41) 


where  a±  =  (<7|  ±  icr2)/2  and  o\.  oi  and  o 3  are  the  Pauli  matrices. 


The  class  of  NLEE  for  the  functions  q±{ jc,  t)  related  to  (41)  can  be  written  in  the  compact 
form  [20-22], 


\o^dtq  +  2f(A)q(x,t)  =  0, 


(42) 


where  / (A)  is  the  dispersion  law  of  the  NLEE  and  A  is  one  of  the  recursion  operators,  acting 
on  the  space  Mo  of  off-diagonal  matrix-valued  functions  as  follows: 


(43) 


The  simplest  nontrivial  example  of  NLEE  is  related  to  a  dispersion  law  of  the  type 
f(k)  —  —2k2.  This  is  the  nonlinear  Schrodinger  equation 


\q*  +  q*x  +  2(</+(jc,  l))2q~(x,  0  =  0. 
iqr  ~  Q7x  ~  2(<7_Cx.  t))2q*{x,  t )  =  0. 


(44) 


Another  well-known  example  is  provided  by  a  cubic  dispersion  law  / (k)  =  4 A.3,  one  gets  the 
system 


</,+  +  Qxxx  +  fy+(0q  ( x ,  t)q+  =  0, 

q;  +qxx  +  bq~{x.t)q+{xyt)q~  =  0, 


(45) 


directly  linking  to  the  Korteweg  de  Vries  equation. 

As  we  discussed  in  the  previous  section  the  scattering  theory  is  based  on  introducing  Jost 
solutions  of  L{k)%  scattering  matrix,  fundamental  solutions,  etc.  In  the  5/(2)  case  the  Jost 
solutions  are  2  x  2  matrix-valued  solutions  dehned  by  an  analog  of  ( 1 1 )  where  the  matrix  J 
is  simply  substituted  by  o 3.  Then  one  introduces  the  scattering  matrix  T(k ,  /)  by 


(46) 


which  is  jc-indepcndent.  The  /-dependence  of  the  scattering  matrix  is  driven  by 
\dtT  +  lf(k)oy,T(k,t)]  =  0. 

Thus,  if  q±{ jc,  /)  satisfy  the  system  of  equations  (42)  we  get 

3,0* (X)  =  0,  id lb±(k)  Ifik^ik)  =  0. 


(47) 


(48) 


The  matrix  elements  of  T(k.  t )  are  not  independent.  They  satisfy  the  ‘umtarity’  condition 
detT(X)  =  0+0~  +  b+b~  =  1  Besides  the  diagonal  elements  a+  and  allow  analytic 
extension  with  respect  to  k  in  the  upper  and  lower  complex  A-plane  respectively.  In  fact  the 
minimal  set  of  scattering  data  which  uniquely  determines  both  the  scattering  matrix  and  the 
corresponding  potential  q{x)  consists  of  two  types  of  variables:  (i)  the  reflection  coefficients 
p±(k)  —  b±/a±  defined  for  real  k  e  R  and  (ii)  a  discrete  set  of  scattering  data  including 


the  discrete  eigenvalues  kf  e  C±  and  the  constants  C*  which  determine  the  norm  of  the 
corresponding  Jost  solutions  [23]. 
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A  simple  analysis  shows  that  the  first  column  of  i/+  allows  analytic  continuation  in  the 
lower  half  plane  of  the  spectral  parameter  while  the  last  one  in  the  upper  half  plane  (for  \j/- 
the  opposite  holds  true) 

f+(x,t,k)  =  \f~  ,VC|,  (49) 

The  superscripts  ±  in  the  columns  of  the  Jost  solutions  refer  to  their  analyticity  properties  while 
the  subscripts  ±  refer  to  different  Jost  solutions  (with  different  limits  of  jc).  The  fundamental 
analytic  solutions  are  constructed  in  the  following  manner: 

x+(x,t,k)=  =  (50) 

The  functions  a*  (A,)  =  detx±(;c,A.)  are  known  as  the  Evans  functions  [6,  24]  of  the 
system  L(A).  Their  importance  comes  from  the  fact  that  they  are  /-independent  (see  (48)),  and 
therefore  they  (or  rather  In  a*)  can  be  viewed  as  generating  functionals  of  the  (local)  integrals 
of  motion.  In  addition  it  is  known  that  their  zeroes  determine  the  discrete  eigenvalues  of  L(A.), 

a+(x£)  =  0,  A£eC+;  <r(x')  =  0,  **<=€_.  (51) 

One  can  define  the  soliton  solutions  of  the  NLEE  as  those  for  which  p±(k)  =  0  for  all 
A  e  R.  Thus  the  soliton  solutions  of  (42)  are  parametrized  by  the  discrete  eigenvalues  and  the 
constants  C f  whose  /-dependence  is  determined  from 

lf=0,  ''if*  2/^=0.  J?  =/(*?)■  (52) 

In  fact  we  will  analyze  the  various  possible  types  of  one-soliton  solutions;  in  our  case 
they  are  determined  by  one  pair  of  discrete  eigenvalues  A.±  e  C±  and  one  pair  of  norming 
constants  C±.  Thus  for  (42)  we  get  just  one  type  of  one-soliton  solutions.  In  order  to  derive  its 
explicit  form  we  shall  use  the  dressing  Zakharov-Shabat  method  [7].  In  our  case  the  dressing 
factor  w(jc,  /,  A.)  is  given  by  a  2  x  2  matrix  of  the  form  (27)  where  P  is  a  projector  of  rank  1 
(see  (30)).  Then  the  following  relations  hold 

P\n(x,  t))  =  |«(jc,  f)),  |n(*.f)>  =  ("‘I’*’!!)  •  (53) 

\n2(x,  t)J 

(m(x,  t)\P(x,  t)  =  {m(x,  f)|.  (m(x.t)\  =  {mi(x,t),m2(x,t)).  (54) 

The  transmission  coefficients  are  transformed  by  the  dressing  procedure  as  follows: 

a+W  =  c(A)a0+(A).  a"(X)  =  (55) 

c(X) 

The  5/(2)  analog  of  (32)  reads 

<?(*,  /)  -  qo(x ,  /)  =  -(AT  -  A.")| <t3,  P(x,  /)].  (56) 

By  applying  the  above  formulae  to  properly  chosen  constant  vectors  | z?o)  and  (mol  we  can 
construct  the  eigenvectors  of  P(x ,  /)  and  as  a  result,  obtain  P(x ,  /)  explicitly.  It  then  remains 
only  to  insert  it  into  (56)  in  order  to  obtain  the  corresponding  potential  q(x ,  /)  explicitly.  It 
can  be  proved  that  the  spectrum  of  L(k)  will  differ  from  the  spectrum  of  T0(A.)  only  by  an 
additional  pair  of  discrete  eigenvalues  located  at  AT  €  C±. 

A  pure  soliton  solution  is  obtained  by  assuming  qo(x,  /)  =  0;  as  a  result  we  have 

|k(jc,  /))  =  e“,(j:;'++^+r)or3|/io).  /)|  =  <wol  el(jrX  t)a\ 

j  /  e<t>oU.o  e -  ■*♦(■* .o\ 

P(X' 0  =  2cosh<I>o(jr.O  \t2  emxJ)  ^ixJ)  )  (57) 

<J>o(x,  t)  =  — i(A+  -  k~)x  +  i(/+  -  f-)t  -  ln/c,, 

<J>(x,/)  =  (A+  +  A-)x-  (f+  +  f~)t. 
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where  /±  and  the  constants  Ky  and  k2  are  given  by 


/±  =  /(X*), 


*1  = 


*0,1  **0,1 


*0.2*10.2 

Then  the  corresponding  one-soliton  solution  takes  the  form 


*2  = 


A  0,1  Wf).  2 
*0,2**().l 


<7+(*,/)  = 


y2q*-X-)e-i,t,(x-<) 

cosh  4>o(x,  /) 


<7  Or,  /)  = 


k2(X+-  X-)e1(t,u<f) 
*2  cosh  <D0(x,  /) 


(58) 


(59) 


Remark  2.  The  eigenvalues  A1*1  are  two  independent  complex  numbers;  therefore,  in  the 
denominator  in  (57)  we  get  cosh  of  complex  argument.  This  function  vanishes  whenever  its 
argument  becomes  equal  to  i(jr/2  +  pit)  for  some  integer  p  and  the  generic  solitons  of  (42) 
may  have  singularities  for  finite  x  and  /. 

One  way  to  avoid  these  singularities  is  to  impose  on  the  Zakharov-Shabat  system  an 
involution,  i.e.  if  we  constrain  the  potential  <7o(*.  /)  by 

q(x.t)  =  qHx.t),  i.e.  q+  =  (q~Y  =  u(x.  t).  (60) 

Such  constraint  reduces  (42)  to  NLEE  for  the  single  function  w(jc,  /);  the  second  equation 
of  the  system  becomes  consequence  of  the  first  one.  As  a  result  (44)  becomes  the  NLS 
equation 

iu,  +  uxx  +  2|w|2k(x,  /)  =  0,  (61) 

while  (45)  goes  into  the  MKdV-type  equation 

Ui  +uXxx  +  6|w(*./)|2m,  =  0.  (62) 

This  involution  imposes  constraints  on  all  the  scattering  data;  in  particular  we  have 

a+(X)  =  b+  (X)  =  (b-(k*))\  (63) 

From  the  first  relation  above  we  find  that  the  zeroes  of  the  functions  a±(k)  which  are  the 
eigenvalues  of  Lq(X)  must  satisfy 

=  (AT)*  =  M  +  iv,  C+  =  (C~)*,  P(x,  /)  =  P](xJ).  (64) 

So  now  the  one-soliton  solution  corresponds  to  a  pair  of  eigenvalues  which  must  be  mutually 
conjugated  pairs. 

As  a  result  we  find  that  the  expression  for  P(x ,  /)  and  the  one  for  the  one-soliton  solution 
simplifies  to 

]  /gOooU.O  e~i<l>oiU,f)\ 

P(x,l)  ~  2 cosh  <t>(X)(jr,/)  ) 

ni 

<t>oo(*i  t)  =  2ujt  —  2 ht  —  In  , 
no 

$01  (x,  I)  =  2 nx  -  2 gt  -aignl  +  arg  nfy  (65) 

where 

A*  =  ±  iu,  f*  =  g  ±  ih.  (66) 

Now  both  functions  <I>oo(jc, /)  and  <Doi(jc,/)  become  real  valued.  The  denominator  now 
becomes  cosh  of  real  argument,  so  this  soliton  solution  is  regular  function  for  all  jc  and  /. 

One  can  impose  on  <7o(*<  0  a  different  involution 

q(x,t)  = -qHx.t).  i.e.  q+  =  -(q~Y  =  u(x.  t).  (67) 
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However  it  is  well  known  that  under  this  involution  the  Zakharov-Shabat  system  L(X)  becomes 
equivalent  to  an  eigenvalue  problem 

c,  /,  X)  =  \(J^dx\j/  +  Oiq(x,  t)i/(x ,  /,  X)  =  ki/(x,  /,  X),  (68) 

where  the  operator  £  is  a  self-adjoint  one,  so  its  spectrum  must  be  on  the  real  X-axis  But  the 
continuous  spectrum  of  C  fills  up  the  whole  real  X-axis,  which  leaves  no  room  for  solitons. 
Finally,  the  Zakharov-Shabat  system  can  be  restricted  by  a  third  involution,  e.g., 

q(x,  r)  =  -qT(x,  t),  i  e.  q+  =  -q~  =  (69) 

Such  involution  is  compatible  only  with  those  NLEE  whose  dispersion  law  is  odd  function 
/ (X)  =  — / (—X).  Therefore  it  cannot  be  applied  to  the  NLS  equation;  applied  to  the  MKdV 
equation  it  gives 

wxt  +  wxxxx  +  6(wx (jc,  t))2wxx  =  0,  (70) 

which  can  be  integrated  ones  with  the  result  v  =  wx% 

Vt  +  Vxxx +b(v(x,t))2Vx  =  0,  (71) 

i.e.  we  get  the  MKdV  equation  for  the  real-valued  function  u(x,  /).  It  is  well  known  also  that 
the  NLEE  with  dispersion  law  / (X)  =  (2X)_I  can  be  explicitly  derived  under  this  reduction 
and  comes  out  to  be  the  famous  sine-Gordon  equation  [25] 

wxt  +  s'\n(2w(x>  /))  =  0.  (72) 

This  second  involution  can  be  imposed  together  with  the  one  in  (60).  The  restrictions  that 
it  imposes  on  the  scattering  data  are  as  follows: 

a+(k)  =  (a~{k*))\  a+(k)  =  (a~(-k)).  (73) 

Now  if  X+  is  an  eigenvalue  of  L(k)  then  (X+)*,  — X+  and  —  (X+)*  must  also  be  eigenvalues. 
This  means  that  wc  can  have  two  configurations  of  eigenvalues: 

(i)  pairs  of  purely  imaginary  eigenvalues 

X+  =  \v  =  — (X+)\  X-  =  —it;  =  — (X~)*:  (74) 


(ii)  quadruplets  of  complex  eigenvalues 

X+  =  /x  +  iu  — (X+)*  =  —n+  iu, 

X-  =  (i  —  it’,  —(X-)*  =  —fi  —  iu. 


(75) 


Thus  we  conclude,  that  the  sine-Gordon  and  MKdV  equations  allow  mv?  types  of  solitons: 
type  1  with  purely  imaginary  pairs  of  eigenvalues  and  type  2  each  corresponding  to  a  quadruplet 
of  eigenvalues.  Type  1  solitons  are  known  also  as  topological  solitons,  or  kinks  (for  details  see 
[2]).  They  are  parametrized  by  two  real  parameters:  v  and  |C+|  so  they  have  just  one  degree 
of  freedom  corresponding  to  the  uniform  motion 

Type  2  solitons  are  known  as  the  breathers  and  are  parametrized  by  four  real  parameters: 
p  and  v  and  the  real  and  imaginary  parts  of  C+.  Therefore  they  have  two  degrees  of  freedom: 
one  corresponds  to  the  uniform  motion  and  the  second  one  describes  the  internal  degree  of 
freedom  responsible  for  the  ‘breathing’. 

The  purpose  of  presenting  the  above  well-known  facts  in  the  above  manner  was  simply  to 
make  it  clear  that  the  structure,  as  well  as  the  number  of  related  parameters  which  determine 
what  different  types  of  solitons  can  exist,  depend  strongly  on  the  type  of,  and  the  number  of, 
different  involutions  that  can  be  imposed  on  the  system. 
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3.  The  Generalized  Zakharov-Shabat  system  and  sl(n)  solitons 

3.1  N-wave  system  related  to  si ( 3) 

In  this  subsection,  we  start  with  the  generic  /V-wave  system  related  to  sl( 3)  and  analyze  its 
reductions.  The  one-soliton  solutions  for  the  generic  case  and  for  the  typical  reductions  are 
well  known  [9,  10].  Next  we  impose  Z2  and  Z2  x  Z2  reductions  and  derive  the  corresponding 
one-soliton  solutions.  Below  we  will  use  a  notation  which  exploits  the  root  structure  of  sl(3), 
namely  0*„/c ,  n  =  1,2  stands  for  the  component  of  Q  associated  with  the  root  a  =  kai+  na 2 
expanded  over  the  simple  roots  a 1  =  e\  —  e2  and  a2  =  e2  —  e 3.  Taking  into  account  that 
convention  the  generic  N- wave  system  for  sl( 3)  consists  of  six  equations;  the  first  three  of 
them  are  given  by 


i(7i  —  Ji)Q  10./  —  i(/i  —  4)010.*  +  3&£>n  =  0,  (76) 

i(7]  —  —  i(/i  —  4)0i  1.*  —  3fc0io0oi  =  0,  (77) 

i(72  —  ^3)001./  —  H4  —  4)0oi,*  +3kQwQn  =  0;  (78) 


the  other  three  equations  are  obtained  from  (76)  by  replacing  J *  <-►  —  7*,  /*  «-►  — /*  and 
Qkn  +*  Qjj;-  We  recall  that  *4  =  0*  4  =  0  and  k  —  J{I2  —  I\J2.  This  system 

can  be  solved  via  a  standard  dressing  procedure  [7,  9]  with  the  dressing  factor  (27).  The 
one-soliton  solution  obtained  that  way  is  given  by  the  following  expressions: 

Slot*,  t)  =  - 

(wi|n> 

Cn(jf.O  =  A'  ~  a,H(um0.3.  (79) 

{m\n) 

Qoi(x,t)  =  -  -7  ;Vi(,.2"io.3. 

(»i|n) 

where  3a  =  Jax  +  Iat  and 

3 

(m\n)  =  )Zin0.jmoj.  (80) 

7-1 

The  expressions  for  the  other  three  fields  can  be  derived  from  these  by  executing  the  following 
changes  Qkn  **  ++  e~'Xz\n0j  **  m0,j. 

Next  we  consider  the  typical  Z2  reductions  of  the  type 

KlUHx.WKi1  =  U(x,X),  =>  KiJ'K;'  =  J,  K\Q'k;'  =  -Q.  (81) 

where  K  \  =  diag(ci ,  ei,  63),  ej  =  1,  is  an  element  of  the  Cartan  subgroup  H  C  SL(  3)  which 
represents  an  action  of  Z2.  This  results  in  reducing  the  number  of  independent  fields  since  we 
have 


Qm  =  <2tt= Qm  =  -^-iQrn 

and  therefore  the  number  of  equations  reduces  from  6  to  3  as  follows: 
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i(7i  —  Ji)Qw%t  —  i(/i  —  4)010.*  ~  3^C2^3Gii0oi  = 
i(7i  —  73)011./  —  i(4  —  4)011.*  —  3^0io0oi  =  0, 
i(*4  —  73)  0oi./  —  i  (4  —  4)0oi.*  —  3^6i620io0h  =  0. 


(82) 

(83) 

(84) 
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The  discrete  eigenvalues  of  Z2-reduced  operator  L  are  complex  conjugated,  i.e.  X~  = 
(X+)*  =  11  —  iu  and  the  polarization  vectors  are  interrelated  via  |/i)  =  K\\w)*.  The  one- 
soliton  solution  in  this  case  is 

2iuev(z,+Z2) 

=  (85) 

=  (86) 

2i  y  e— 

001  (X,  t)  = - -  e-,^(Z2_Z,)Mo  2^3^0  3-  (87) 

*  1  <n*|K,|/j)  *  o*3 


Remark  3.  In  general,  the  denominator  (80)  of  the  expressions  for  the  one-soliton  solution 
can  possesses  zeros  for  some  x  and  r,  i.e  we  have  singular  solitons.  The  same  holds  true 
also  for  the  3-wave  system  with  the  typical  involutions  for  which  K  ^  A.  This  is  directly 
related  to  the  well-known  ‘blow-up’  instability  [9].  For  the  typical  involution  with  K\  =  i 
the  corresponding  denominator  is  a  sum  real-valued  exponentials  multiplied  by  some  positive 
factors  which  is  always  positive. 

By  imposing  another  Z2  reduction  on  the  potential  U(x ,  A),  namely 
K2UT(x,-\)K2'  =-U( x.X),  K2JtK^=J,  K2QtK2'  =  Q,  (88) 

where  K2  e  H  satisfies  [Kf%  K 2]  =  0  we  obtain  a  Z2  x  Z2-reduccd  s/(3)/V-wave  system. 
As  a  consequence  we  have  a  pair  of  purely  imaginary  eigenvalues  X±  =  ±ie.  Choosing 
K\  =  K2  =  1  we  see  that  the  three  independent  fields  Q\o(x%t),  Qoi(x,t)  and  Q\\(x,t) 
are  purely  imaginary  while  the  polarization  vector  is  real,  | n)*  =  |w).  After  introducing  new 
variables 

G10U.O  =  iqio  (x.t),  Qoiix.t)  =  iq()i(xj),  Qi\(x,t)  =  iqn(.v,r), 
we  derive  a  real  3-wave  system  for  three  real-valued  fields 

(J  1  —  *^2)q  10. r  —  (h  —  h)quu  +  3/:qnqoi  =  0, 

(A  —  J*)qiu  ~  (/1  —  /3)qi l.x  —  3/:qioqoi  =  0,  (89) 

(A  —  /Oqou  —  (h  —  h)  qoi,x  +  3/:qioqn  =  0 

Since  the  dressing  factor  must  satisfy  the  conditions 

(Mt(jcfx*))’1  =m(x,  X),  (90) 

(wr(jc,-X))-‘  =  W(J c,  A),  (91) 


the  projector  P  is  real  valued.  In  this  case  the  discrete  eigenvalues  are  purely  imaginary,  i.e. 
=  iriu.  The  one-soliton  solution  is 


q«(*)  =  -2vPk/(x), 


p  = 


('ll")  ’ 


kjil. 


Taking  into  account  that  |«>  =  ev',‘|/io)  we  derive  explicitly  the  following  result: 


qio(*,f)  =  - 
qn(*.f)  =  - 

=  - 


_ 2uel,Ul+a)n0.iHo.2 _ 

eln'nlA  +e2vz''i^2+e-2w<z'+z'»n^3’ 

_ 2ve~VZ3n0  i/ip.3 _ 

+  e2vZ3r>o.2 +  e~Mz'*Z2)nl.i  ’ 

_ 2ue~1’Zl>to,2'io,3 _ 

e2wz<nj,  +e2vZj'i^2  +e-2l,(z>+Z2,/i(23' 


(92) 
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In  the  Z2  x  Z2  case  there  exists  another  type  of  soliton  solutions — these  obtained  by  using 


a  dressing  factor  of  the  form 

u(x.  X)  =  1  +  (c(X)  —  1)A(jc)  + 

while  its  inverse  reads 

\  x-x+ 

1  )Bix)-  C(X)=X-X-' 

(93) 

u~l(x,  X)  =  1  +  | 

!sr') 

I  C(x)  +  (c(— A.)  —  \)D(x). 

(94) 

These  solutions  are  associated  with  four  discrete  eigenvalues  of  the  scattering  operator  L : 
±X±.  In  this  sense  they  may  be  called  quadruplet  solitons  unlike  the  solutions  (92)  which 
being  associated  with  two  eigenvalues  ±iv  represent  doublet  solitons.  The  dressing  factor 
(93)  is  Z2  x  Z2  invariant  if  the  conditions  hold  true 

K\(u^(x,  X*))-1  K^x  =  u(x,  X),  (95) 

K2{uT{x ,  -X))"1  K 2_l  =  u(x,  X).  (96) 


Hence  the  following  relations  hold: 

B  =  K}K2A*K{lK^\  C  =  KlAiK^\  D  =  K2AT  (97) 


as  well  as  X  =  (X*)*  =  /t  —  iv. 

The  matrix-valued  function  A  admits  the  decomposition 

A(x)  =  X(x)FT  (x).  (98) 

By  using  the  equality  uu~l  =  1  one  can  prove  that  the  factor  X(jc)  can  be  expressed  by  F(x) 
as  follows: 

*=  -= - rrxlaKiF'-b'KzF).  (99) 

a2  —  \b\2 

where 

,  i  vFt  K2F  t  t  . 

a  =  F'K,F.  b  = - FT(x)  =  Fil(xri(x,k-))-'.  (100) 

fj.  -  \v 

To  find  the  one-soliton  solution  we  take  the  limit  X  — ►  oc  in  (23)  and  put  qo  =  0.  Thus  we 
obtain  the  following  formula: 

Q)\  =  (X-  -X*)(A  +  KlK1AmK1Kt)Jk.  j?k.  (101) 

Let  K\  =  K2  =  I.  Then  Q*  =  —  Q  and  using  the  above  notation  we  have  for  the  one-soliton 
solution: 


4  3  f 

qio  =  -— /«  Y  /**  \ C0SM  - te) 
*-1  l 


vcos(2 4>k  -4>f  -<p2+<t>o) 


+  vz 


(102) 


qn 


kml 


COS(0|  -  03)  - 


VCOS(20,t  -  01  -  03  +  0p) 


(103) 


4v 


qoi 


--rfxYfkk 


*-l 


COS(02  -fo)~ 


ucos(2 (pk  ~(p2  -  03  +  0o) 


+ 


A  =  a2-\b\2  =  ^— - 

//*  =  I  Fq'J  F0'k  |  el,(z'+z*\ 
where  X+  =  i^//2  +  u2  e'^°. 


f1  Y  & +2Y  + 21,2  sin2(^  -  ^)i 

*=i  it<p 

0*0*.  f)  =  nzk  +  h.  Sk  =  arg  FM, 


(104) 
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3.2.  N-wave  systetns  related  to  si (n),n  >  3 

For  the  sake  of  simplicity  and  clarity  below,  most  of  our  discussions  will  be  restricted  to  the 
case  n  =  5  and  rank-1  projectors  P.  They  also  could  easily  be  reformulated  for  any  other 
chosen  value  of  n  and  for  higher  rank  projectors.  The  corresponding  Lax  operator  L(X)  which 
is  a  particular  case  of  (3)  with 

L  =  \dx  +  U(x,  /,  X)  =  \dx  +  [J,  Q0 t,  /)]  -  XJ. 


J  =  diag(Ji,  J2-,  J4,  J$), 


Q(x,t)  = 


0 

012 

0 13 

014 

0u\ 

021 

0 

023 

024 

025 

03. 

032 

0 

034 

0.5 

041 

042 

043 

0 

045 

\0s. 

0  32 

053 

054 

0  / 

(106) 


(107) 


Furthermore,  for  definiteness  we  will  assume  that 

tr  J  =  0,  J\  >  Jj  >  J3  >  0,  0  >  y4  >  Js,. 

The  A/-operator  in  the  Lax  representation  for  the  N-wave  equation  (10)  is  given  by 
jV/i/r(jt,  /,  X)  =  i d[\l/  +  ([/,  Q(x,  /)]  —  XI)\f/(x,  /,  X)  =  — Xtyix,  /,  A)/, 

where  /  =  diag(/i . Is)  is  a  traceless  matrix.  The  generic  one-soliton  solution  can  be 

derived  by  using  (32), 

q(x)=  litn  k(J  —u(x,  k)Ju  '(x,X))  =-(X* -X~)[J,  P{x)].  (108) 

k-*oo 

with  a  generic  projector  P  whose  rank  s  can  be  bigger  than  I , 

5 

P(x,t)=  ^  \na(x.t))M~^(nl(x.t)\.  Mah{x.i)  =  (n'h(x,t)\na(x,t)), 

a.b=  1 

I "aU.  t ))  =  Xq  (X,  I ,  A.+)|/I0.a),  {/lO.a  l^()!«0.*>  =  0. 

Note  that  the  set  of  s  linearly  independent  polarization  vectors  |/i*)  determine  the 
corresponding  eigensubspace  of  L.  Such  subspace  can  be  defined  either  as  the  image  of  P 
or  as  the  kernel  of  the  projector  P  =  1  —  P  which  is  defined  by  a  complimentary  set  of  n  ~  s 
polarization  vectors.  Therefore  studying  s/(/7)-type  Zakharov-Shabat  systems  it  is  enough  to 
analyze  projectors  of  rank  s  ^  [n/2],  where  [n/2]  is  the  entire  part  of  n/2.  Thus  for  n  =  3  it 
is  enough  to  study  rank-1  projectors,  while  for  n  =  5  one  needs  also  rank-2  projectors. 

For  s  =  2  we  have  two  linearly  independent  polarization  vectors  | na).a  =  1,2  and  from 
(109)  we  get 


(109) 


P(X  y  t)  = 


det  M 


(|fl|C*.  r))M22[n\(x.  /)|  -  | n2(x.  t))M]2(n\(x,t)\ 


-  |«i(jr,  t))M2\{n[(x,  /)|  +  |n2(Jr.  t))Mu[n\(x ,  0|). 
det  M(x,t)  =  M,|M22  -  M|2M2i,  Mah(x,t)  =  {nl{x,  /)|«*(-V.  t)). 

Let  first  concentrate  on  rank- 1  projectors,  see  (93)  with 

\n(x))  =  Xo(x^>^)\no),  (m(x)\  =  (m0 |x0"(.v,  A"). 


(HO) 


(HI) 


The  polarization  vectors  \no)  and  (mol  are  constant  5-vectors.  Thus  these  type  of  one-soliton 
solutions  is  parametrized  by: 

(i)  The  discrete  eigenvalues  X±  =  /i^db  iu*; /x*  determine  the  soli  ton  velocity,  v±  determine 
the  amplitude. 
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(ii)  The  ‘polarization’  vectors  \rio ),  I  parametnze  the  internal  degrees  of  freedom  of  the 

soliton.  Note  that  P(x)  is  invariant  under  the  scaling  of  each  of  these  vectors.  Genetically 
each  ‘polarization’  has  five  components,  one  of  which  can  be  fixed,  say  to  1.  So  each 
‘polarization’  is  determined  by  four  independent  complex  parameters 
We  have  several  options  that  will  lead  to  different  types  of  solitons: 

(1)  generic  case  when  all  components  of  |«o)  are  non-vanishing; 

(2)  several  special  subcases  when  one  (or  several)  of  these  components  vanish.  The 
corresponding  solitons  will  have  different  structures  and  properties. 

For  the  generic  choice  of  \tiq)  one  finds 

lim  P(x,t)  =  P± ,  P+  =  £ , , ,  P-  —  (112) 

.r-*±oo 

where  the  matrix  Ekj  has  only  one  non-vanishing  matrix  element  equal  to  1  at  position  /c,  y\ 
i.e.  (Ekj)mp  =  &km&jp-  Therefore  both  the  limiting  values  u±(k)  and  their  inverse  u±(k)  are 
diagonal  matrices 

u+(k)  =  diag(c(A),  1,1 . 1),  U-(X)  =  diag(K  I . l,c(A.)).  (1 13) 

From  (25)  forn  =  5  we  have 


fiyW  =  c(A)(70),y(A).  j  =  1, 2,3,4; 

7)5 w  =  (%W/CW,  j  =  2,  3, 4,  5; 

T,’j(k)  =  (Tu)jj(X),  for  all  other  values  of  i,  j. 


(114) 

(115) 


This  relation  allows  us  to  derive  the  interrelations  between  the  Gauss  factors  of  7o(A)  and 
T (k).  In  particular  we  find  for  the  principal  minors  of  T (A.), 

«*(«  =  ca)m^(X),  mk(k)  =  m^k(k)/c(k),  (116) 

where  mJ(A)  (resp.  tnk(k))  are  the  upper  (resp.  lower)  pnncipal  minors  of  T(k).  Since 
X(f  (jc,  t.k)  are  regular  solutions  of  the  RHP  then  fn^k(k)  have  no  zeroes  at  all,  but  (1 16) 
means  all  mf(k)  have  a  simple  zero  at  k  =  X±. 

The  generic  one-soliton  solution  then  is  obtained  by  taking  x±(jc,  /,  k)  =  exp(— \kJx). 
As  a  result  we  get 

(P(X,  t))ks  =  ■■  .  e~l<x<z*~x  z,).  (117) 

k(x.t ) 

n 

k{x,t)  =  ^no./,»«o.pe"i<r-r)z'u,'>.  (118) 

P=  1 

Zp(x.l)  =  Jpx  +  lpt,  qll  =  -(k* -X")(  P(x,t))k„  (119) 


i.e.  in  all  channels  we  have  nontrivial  waves.  The  number  of  internal  degrees  of  freedom  is 
2 (n  —  1)  =  8.  Note  that  the  denominator  k(x ,  /)  is  a  linear  combination  of  exponentials  with 
complex  arguments,  so  it  could  vanish  for  certain  values  of  x,  t.  Thus  the  generic  soliton  (117) 
in  this  case  is  a  singular  solution. 

Next  we  impose  on  U{x,t ,  k)  the  involution 

KU'(x.t.k*)K-'  =  U(x,t,k),  K  =  diag(«i . e„),  (120) 

with  €j  =  ±1.  More  specifically  this  means  that 

Kq\x,  t)K~'  =q(x,t ),  K  u](x,  t,  k*)K~]  =  w-i(jc,  t,  X),  (121) 
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and 

k+  =  (k-)'  =  n+\v.  <m„|  =  (£K»t. 

Thus  only  |«o)  is  independent. 

Then  the  one-soliton  solution  simplihes  to 


^ls/  _  ^lv(Jk  Js)  *  rv(zk+zt)  r-W(Zk-Z,) 

w^Tj  6in°*"0ie 


krriix.t)  = 

P=l 


(122) 


(123) 

(124) 


The  number  of  internal  degrees  of  freedom  now  is  n  —  1  =4.  If  one  or  more  of  €j  are  different, 
then  this  reduced  soliton  may  still  have  singularities.  The  singularities  are  absent  only  if  all 
e j  are  equal 

The  analysis  of  solitons  obtained  with  rank-2  projectors  should  go  along  the  same  lines. 
It  is  lengthier  than  the  one  above  and  we  may  omit  it.  We  just  note  that  even  with  the  canonical 
reduction  with  K  =  J  in  (121),  one  cannot  guarantee  that  det  M  >  0  for  all  x  and  /  (see  (1 10)) 
which  means  that  one  may  encounter  singular  solitons. 


3.3.  Typical  si (2)  solitons 


The  hnite-dimensional  irreducible  representations  of  the  algebra  5/(2)  are  labeled  by  their  spin 
J  and  have  dimension  27  +  1;  the  spin  J  can  take  half-integer  positive  values.  The  simplest 
representation  with  7  =  2  is  two  dimensional  and  is  known  also  as  the  typical  representation  of 
5/(2).  In  this  subsection,  we  will  address  first  the  simplest  possibility  when  5/(2)  is  embedded 
into  5/(5)  via  its  typical  representation;  we  will  call  such  solitons  typical  5/(2)  solitons 

From  now  on  we  assume  that  the  reduction  (120)  with  ep  =  1  holds.  Here  |a?0.i)  has 
only  two  non-vanishing  components.  We  consider  here  three  examples  with  n  =  5  and  three 
different  choices  for  the  polarization  vectors 


( 0  \ 

f"0.1> 

0 

"0.2 

"0.2 

0 

(b)  |«o)  = 

0 

(c)  bio)  = 

0 

0 

"0.4 

0 

VI  0.5/ 

V  0  j 

V  0  / 

(125) 


In  all  these  cases  the  corresponding  one-soliton  solutions  <7  (jc,  /)  are  given  by  similar  analytic 
expressions,  each  having  only  two  non-vanishing  matrix  elements 

j v(J.  —  J.)  ei<*rg('*o./)-arg(n(u»  Q-w(Jj-JkHx+v’jkn 

qjk(x,  1)  =  (qjk(x,t)y  = - /  —  — - : — r-j - i — Ti - Tr  ^126) 

cosh[v(yy  -  Jk)(x  +  wjkt )  +  lnK.yl  -  InKjtl] 

where  we  recall  that  Wjk  =  (Ij  —  h)/Uj  —  7*),  j  <  k.  For  case  (a)  we  have  j  =  \%k  =  5; 

in  case  (b):  j  =  2,  k  =  4  and  in  case  (c)  j  =  1  and  k  =  2. 

The  5/(2)  soliton  is  very  much  like  the  NLS  soliton  (apart  from  the  /-dependence);  the 

NLS  soliton  has  only  one  internal  degree  of  freedom. 

The  different  choices  for  the  polarization  vector  result  in  different  asymptotics  for  the 

projector  Pi(x), 


(a) 

lim  P{x)  =  £n. 

lim 

P(X)  = 

£55. 

.1— ►oo 

.K— -OO 

(b) 

lim  P{x)  ~  E22, 

lim 

P(x)  = 

£44. 

X—*  OO 

.r-»-oo 

(c) 

lim  P(x)  =  £n. 

lim 

P(x)  = 

£22. 

.r— »oc  x  —*  —  oo 
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In  case  (a)  the  results  for  the  limits  of  P(jc)  and  for  w±(A)  are  the  same  as  for  the  generic 
case,  see  (112)  and  (113).  Asa  consequence,  such  5/(2)  solitons  require  the  vanishing  of  all 
Evans  functions  mf( A)  for  A  =  X±,  see  (116). 

In  case  (b)  from  (25)  and  from  the  appendices  we  get  that  such  5/(2)  soliton  provides  for 
the  vanishing  of  mf  (A)  and  mf  (X), 

ifijtt)  =  c(A)<2(A), 

mjW  =  Jftoa2(A.)/c(  A), 

whereas  mf  (A.)  =  mf  {(X)  and  mf  (A)  =  mf4(A)  remain  regular  and  do  not  have  zeros  at 
X  =  X±. 

Likewise  in  case  (c)  we  get  that  only  m|(A)  and  m4  (A)  acquire  zeroes, 

m|(A)  =  c(A)mJ  ,(A)f  m-(A)  =  «io.4(A)/c(A),  (129) 

and  all  the  other  Evans  functions  mj( A)  with  j  =  2,  3,  4,  and  m‘(A)  with  p  =  1, 2,  3  do  not 
have  zeroes. 


/?I3  (A)  =  C“(A)??Iq  3  (A) , 
m^  (A)  =  mo  3(A)/c(A), 


(128) 


3.4.  Higher  spin  J  sl{ 2)  solitons 

Here  we  provide  some  examples  of  5/(2)  soliton  solutions  which  are  embedded  into  5/(5) 
via  a  higher  (2 J  +  1  )-dimensional  representation  of  5/(2)  which  we  call  spin  J  5/(2)  solitons. 
Obviously  for  g  —  5/(5)  the  ‘spin’  of  the  solitons  can  take  values  J  =  1 , 3/2  and  2. 

It  is  well  known  that  spin  J  representation  can  be  constructed  using  the  completely 
symmetrized  tensor  products  of  the  typical  2x2  representation.  The  details  of  their  derivations 
are  given  in  appendix  B  Here  we  briefly  formulate  the  results. 

Before  starting  with  to  calculate  symmetrized  tensor  products  of  the  dressing  factor  we 
have  to  make  a  slight  modification  so  that  it  takes  values  in  the  group  SL( 2).  Indeed,  as  it  is 
given  by  (27)  and  (65)  one  easily  checks  that  detn(jc.  A)  =  c(X).  Therefore  we  multiply  it  by 
the  constant  1  / ■yJc(X)  factor  so  that  its  determinant  equals  1 .  Thus  we  can  rewrite  the  dressing 
factor  for  the  typical  5/(2)  soliton  in  the  form 

u(x,X)  =  JcWP(x)  +  -X=P(x),  (130) 


where  P 


■■  1  —  P\  in  terms  of  the  polarization  vectors  they  take  the  form 
1  / n\tn\  n\m2\  -p  _ 1  /  n2m2  -npn2} 

>i2mi 


/  n2m2  -npn2\ 
\-n2m  1  nimi  )' 


(131) 


n\m\  +  n2m2  \n2*n\  n2m2)  /ijmi  +^2^2  V- ^ n2mi  n\ni\ 

After  some  calculations  we  get  the  following  results  for  the  higher  spin  dressing  factors: 


L/(3)  =  U  O  U  —  ^A)^.1’'  +  7TnJ'  +  - 71 

1  0  c(A) 


(3) 


(3) 


1 


(3) 


u 


(4) 


3/2 

=  U  O  U  O  U  =  ^  7T, 

/=- 3/2 


(4  )J 


c  (A), 


(132) 


U 


(5) 


=  uQuQuQu  =  ^  7r/5V(A). 


l=-2 


By  u  O  u ,  u  O  u  O  u ,  etc  we  have  denoted  the  completely  symmetrized  part  of  the  corresponding 
tensor  powers  of  the  dressing  factor  w(-*.  A),  the  superscript  k  of  (J(k)  denotes  the  dimension 
27  +  1  of  the  representation.  By  7T^k)  we  have  denoted  mutually  orthogonal  rank-1  projectors 

mm 

7Xq  1 Tfo  — oafj7la  .  (133) 


r(*)  _ 
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The  explicit  expressions  for  the  vectors  | N^k))  and  (M^k)\  in  terms  of  n ,*  and  nij  are  given  in 
appendix  A,  Here  we  just  mention  that 

=  {Nkk)\S(k\  (134) 

where  the  matrices  S^k) 


r<*)  _ 
°0  “ 


5=0 

(k  —  l)/2 

E  -  Em-J 

5=0 


for  &  =  2p  +  1 
for  k  =  2p 


(135) 


are  the  ones  used  to  define  the  orthogonal  algebras  so(k). 

All  these  properties  of  the  projectors  7i^k)  allow  us  to  rewrite  the  dressing  factors  in  the 
form 


U(k\x,X) 

for  odd  values  of  k  and 
Uik)(x,k) 


(k- 1)/2 

exp  j  In c(X)  ^  (jr***  —  x _s)  +  jt{ 

5=1 


exp 


(  a-o/2 

lnc(X)  E 

(  5=1/2 


(136) 


(137) 


for  even  values  of  k.  It  is  important  to  note  that  due  to  ( 1 34)  the  differences  7i$k)  —  jri**  €  so(k) 
and  also  7i^k)  €  so(/c). 

Let  us  now  outline  how  one  can  construct  spin  3/2  soliton  of  the  A-wave  equations  related 
to  sl( 5).  First  we  have  to  embed  the  dressing  factor  Ui4)  which  is  a  4  x  4  matrix  into  a  5  x  5 
dressing  factor.  This  can  be  done  in  several  inequivalent  ways,  which  reflects  the  fact  that 
the  group  50(4)  can  be  embedded  into  the  group  SL( 5)  in  different  ways.  As  first  of  them 
we  choose  the  following  one.  First  we  extend  the  four-component  vectors  N^k))  into  five- 
component  ones  |N^})  =  ((N^,  0)r.  We  will  need  also  the  vector  |e5)  =  (0,0, 0,0,  \)T . 
Then  we  can  construct  the  5  x  5  dressing  factor 

U,4)  =  n(*y2c~i/2(X)  +  7t(*l/2c~>/2(k)  +  jt,<4Jc,/2(A.)  +  T34jC3/2(X),  (138) 

where 


71 


(4)  = 
a 


1N14>)(M14)| 

K'K1)  ’ 


,(4) 


KK1 

KK>) 


+  |e5>(e5|. 


(139) 


In  order  to  calculate  the  corresponding  soliton  solution  it  remains  to  insert  U(5)  as  u(x,  X) 
into  (108).  For  Q{ o)  =  Othe  result  will  be 


/  0 

v/3<7+ 

0 

0 

o\ 

V^q~ 

0 

V 

0 

0 

0 

2q~ 

0 

y/3q+ 

0 

(140) 

0 

0 

V3  q~ 

0 

0 

V  0 

0 

0 

0 

o) 

where  the  functions  q±(x ,  t)  are  given  by  (59). 

It  is  natural  to  analyze  the  structure  of  the  discrete  eigenvalues  X±  corresponding  to  these 
types  of  soliton  solutions.  The  discrete  eigenvalues  of  L  are  the  zeroes  of  the  principal  minors 
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(141) 


tnf(X)  of  the  scattering  matrix  T(X).  To  this  end  we  have  to  calculate  the  diagonal  factors 
D±( X)  of  the  Gauss  decomposition  of  T {X)  The  result  is 

m|(X)  =  c3(A)<,  (X).  mjtt)  =  c4(X)m^.2(X). 

ml(X)  =  c3(X)m£3(X).  mJ(X)  =  m*4(X). 

Comparing  with  (129)  we  conclude  that  going  to  higher  representations  of  5/(2)  leads  to 
multiple  zeroes  of  some  of  the  principle  minors  of  T(X).  Therefore  the  resolvent  of  L  will 
have  poles  of  higher  order  at  X± ,  though  the  residues  at  this  points  will  be  expressed  by  the 
same  polarization  vectors  as  for  the  ^-spin  solitons.  One  consequence  of  this  construction  is 
that  the  higher  spin  5/(2)  solitons  have  the  two  degrees  of  freedom  as  the  standard  (spin  ^) 
solitons. 


3.5.  Typical  si  (3)  -solitons 

Here  |/i0)  has  three  non-vanishing  components, 
polarization  vectors 


^0.1^ 

(  0  \ 

0 

'h).2 

>hi2 

(a)  |/io)  = 

no, 3 

(b)  |«0)  = 

'10. 3 

(C)  |/lo>  = 

"0.3 

0 

'*0.4 

0 

VI0.5/ 

0  ) 

\o/ 

We  consider  three  examples  of  such 


(142) 


Therefore  the  5/(3)-solitons  have  two  internal  degrees  of  freedom. 
The  asymptotics  of  the  projector  P(x)  read  as  follows: 

(a)  lim  P(jr)  =  E\\%  lim  P(x )  =  £55, 

.*—*•00  x-*-oo 

(b)  lim  P(x)  =  £22,  lim  P(x)  =  £44. 

Jf-*00  .t— *— 00 

(c)  lim  P(x)  =  £u,  lim  P(x)  =  £33. 


(143) 


Note  that  cases  (a)  and  (b)  in  (143)  coincide  with  the  corresponding  cases  in  (125). 
Therefore  the  set  of  Evans  functions  that  acquire  zeroes  will  be  the  same  as  for  the 
corresponding  5/(2)  solitons.  In  case  (c)  of  (143)  we  have 

mj(X)  =  c(X)m^  , (X),  mJW  =c(X)m^2(X), 

(144) 

m4  (X)  =  m0  A(X)/cX),  m3  ( X )  =  m0  3(X)/c{X), 

whereas  the  remaining  Evans  functions  m'j(X)  with  j  =  3, 4,  and  m“(X)  with  p  =  1,2  remain 
regular. 

In  case  (a)  the  corresponding  one-soliton  solutions  acquire  the  form 


/0 

0 

<713 

0 

<?15\ 

(° 

0 

0 

0 

°\ 

0 

0 

0 

0 

0 

0 

0 

<723 

<724 

0 

(a)  q1’ 

‘  = 

Q\3 

0 

0 

0 

<735 

(b)  q"  = 

0 

<723 

0 

<734 

0 

0 

0 

0 

0 

0 

0 

<7  24 

<7  34 

0 

0 

Ws 

0 

«35 

0 

0  / 

Vo 

0 

0 

0 

0/ 

(° 

<?12 

<723 

0 

°\ 

<lh 

0 

<723 

0 

0 

(c)  q" 

'  = 

^23 

0 

0 

0 

0 

0 

0 

0 

0 

V  0 

0 

0 

0 

0) 

(145) 
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where  the  matrix  elements  t)  are  given  by 

i v(Jk  ~  Js)€s^Ot+J')(x*Vl'nna.kir0s  e-mUt-J,)(x*w„n 

***'  '  '  |«o.i  I2  c2*'<-,>jr+,,/>  +  |/io.3l2  e2l,<'^Jr+^»,>  +  |«o.5l2 e2*'<-/sX+/;,,>  * 


(146) 


and 

h  =  h  —  U\  +  ^3  +  ^s)/3,  h  =  h  —  ( I\  +  h  +  /s)/3,  i’jtt  =  y — w-.  (147) 

Ik  +  Is 

This  soliton  has  two  internal  degrees  of  freedom  and  is  regular. 

Obviously  it  is  by  now  clear  how  one  can  write  more  complicated  solitons  like  5/(4) 
which  would  be  characterized  by  the  polarization  vectors  of  the  form 


Mu\ 

fao.l> 

'10.2 

"0.2 

"0.3 

.  (b)  |/i<>)  = 

"0.3 

"0.4 

0 

Vo) 

V"0,5/ 

(148) 


The  5/ (4) -solitons  will  have  three  internal  degrees  of  freedom. 

We  note  here  that  due  to  our  choice  of  J  in  (106),  s/(4)-solitons  cannot  give  rise  to 
generalized  eigenfunctions 

Of  course  one  could  try  to  embed  5/(3)  into  g  using  one  of  its  higher  dimensional 
representations.  The  algebra  5/(3)  has  a  second  three-dimensional  representation  which  is 
obtained  related  to  the  typical  one  used  above  by  an  exterior  automorphism.  The  corresponding 
soliton  solutions  are  obtained  from  those  demonstrated  above  by  a  re-parameterization  of 
the  polarization  vectors.  The  next  irreducible  representation  of  5/(3)  which  is  the  adjoint 
representation,  has  dimension  8,  so  it  cannot  be  embedded  into  5/(5).  Of  course,  considering 
algebras  g  of  rank  high  enough  it  is  possible  to  embed  subalgebras  go  using  their  non-typical 
representations. 


4.  Eigenfunctions  and  eigensubs  paces 


The  structure  of  these  eigensubspaces  and  the  corresponding  solitons  becomes  more 
complicated  with  the  growth  of  n. 

In  what  follows  we  start  with  the  generic  case  and  split  the  ‘polarization'  vector  into  two 
parts 


/"0.l\ 

/  0  \ 

"0,2 

0 

l"o>  =  I/*)}  +  M<»);  Ipo)  = 

"0,3 

\da)  = 

0 

0 

"0.4 

V  o  ) 

\"0.5/ 

and  therefore 


(149) 


In)  =  I P)  +  I d),  | p)  =  Xo+U.  *+)l Pol  \d)  =  X0+(*<  *+)Mi>.  (150) 

This  splitting  is  compatible  with  (106)  and  has  the  advantage,  if  X(|U<  ^+)  =  exp(— iX+ijc) 
then  | p)  increases  exponentially  for  x  — ►  oc  and  decreases  exponentially  for  jc  — ►  —  oo; 
| d)  decreases  exponentially  for  x  — ►  oc  and  increases  exponentially  for  x  — ►  — oc,  see  also 
lemma  I 
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What  we  will  prove  below  is  that  one  can  take  a  special  linear  combination  of  the  columns 
of  Xq  (jc,  X+)  which  decreases  exponentially  for  both  x  — ►  oc  and  x  — ►  —  oo.  Doing  this  we 
will  use  the  fact  that 

X+(*.*+)l«o>  =  (l  -  P(Jc))x+(JT,  A+)|no>  =  d  -  P(x))|n(jr)>  =  0.  (151) 


Lemma  1.  The  eigenfunctions  of  L  provided  by 

f+(x)  =  X+(x,\+)\pQ)  =  -x+(*.*+)M>>.  (152) 

decrease  exponentially  for  both  x  — ►  oo  and  x  — ►  —  oo. 


Proof.  From  (151)  and  (149)  there  follows  that  both  expressions  for  /+(a,  t)  coincide,  so  we 
can  use  each  of  them  to  our  advantage,  see  (152)  We  will  use  also  the  fact  that  1  —  P(jc)  is  a 
bounded  function  of  jc. 

We  start  with 

lim  /+( x)=  lim  x',+U.>-+)Mo)  =  (I  -  P+)  lim  e-ix'JxT~ (V)\d{)).  (153) 

.r— ►oo  x->  oc  r— ►oc 


where  T“(A+)  is  the  lower  triangular  matrix  introduced  in  (17).  If  the  potential  is  on  finite 
support  or  is  reflectionless  then  T“(X)  is  rational  function  well  defined  for  k  =  A+.  If 
the  potential  is  generic  then  T“(X)  does  not  allow  analytic  continuation  off  the  real  axis. 
Nevertheless  T“(A+)  can  be  understood  as  lower  triangular  constant  matnx  (generalizing 
the  constant  C{+  of  the  NLS  case).  Being  lower  triangular  T“(A+)  maps  \do)  onto 
| d'{))  =  Tq  (\+)\do)  which  is  again  of  the  form  (149),  i.e.  its  first  three  components  vanish. 
Therefore 


lim  =  lim  (II  -  P+)evax 

x-*oo  *— »-oc 


/  °  \ 
0 
0 

L  "o,4 

\e-A*J>*n'0J 


=  0. 


for  any  constant  a  >  0  such  that  a  +  J4  <  0 

Likewise  we  can  calculate  the  limit  for  a  — ►  —00 


(154) 


lim  /+M  =  -  lim  x'+(* .kfilpo)  =  -(1  -  P+)  lim  e-,k'JxS+(X+)\pn). 

C—  -OO  JC— ►  —  OO  X-*-00 


(155) 


The  upper  triangular  matrix  S+(X+)  is  treated  analogously  as  T~(A.+).  In  the  generic  case  it  is 
just  an  upper  triangular  constant  matrix  which  maps  |/?o)  onto  \p't))  =  S+(k*)\p0)  whose  last 
two  components  vanish.  Therefore 


lim  evhKf+(x) 


lim  evhx ( I  -  P_) 


/e^X,\ 

J2X 


0.2 

^0.3 


=  o. 


for  any  constant  b  <  0  such  that  +  b  >  0. 
The  lemma  is  proved. 


(156) 


□ 


For  the  choices  (a)  and  (b)  of  \n0)  in  (125)  we  define  the  square  integrable  discrete 
eigenfunctions  using  the  splitting  (149)  and  (152). 
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Remark  4.  The  choice  (c)  for  |«o)  does  not  allow  for  the  splitting  (149).  In  this  case  we  can 
introduce  only  generalized  discrete  eigenfunctions,  /gcn(j c,  /),  which  are  not  square  integrable. 
But  upon  multiplying  by  the  exponential  factor  t~vcx  with  c  =  (J\  +  Ji )/2,  we  can  obtain 
square  integrable  functions  f(x)  =  /gcn(JC)e_vc't-  See  also  the  discussion  in  the  following 
subsection. 

The  generalized  eigenfunctions  come  up  in  situations  when  the  splitting  (149)  is  not 
possible,  i.e.  when  either  \p0)  or  \d0)  vanish  Let  us  construct  the  generalized  eigenfunction 
for  the  polarization  vector  |«o)  of  case  (c)  in  (142).  Let  ( J\  +  Ji  +  Jf)/ 3  =  a'\  then 
J[  =  Jx  —a\  J[  =  J2— a'  and  =  J$  —  a'  are  such  that  J[  >  J2  >  ^3  and  J{  +  J2  +  Jy  =  0. 
Let  us  assume  for  definiteness  that  J[  >  >  0  and  0  >  Jy  Then  we  can  split  |«o)  into 


l«o)  =  \p'o )  +  Mo). 

1  Po)  = 

Al0l\ 

mu 

0 

Mo)  = 

f0\ 

0 

"0.3 

0 

0 

Vo/  Vo/ 

and  define 


/+•'(*)  =  x+(x^+)\Po)  =  “X+(*^+)Ki>-  058) 

Obviously  f+,(x)  is  an  eigenfunction  of  the  dressed  operator  L  corresponding  to  the  eigenvalue 

X+. 

Then  we  can  prove  the  following  lemma. 

Lemma  2.  The  eigenfunction  f+  '(x)  is  such  that  eVia  x  f+,,(x)  decreases  exponentially  for 
both  x  — >  ±oc. 

Proof.  The  proof  is  similar  to  the  one  of  lemma  1  and  we  omit  it.  □ 

Since  the  polarization  vector  |z?o)  in  case  (c)  of  (142)  does  not  allow  the  splitting  (149) 
the  corresponding  discrete  eigenfunction  will  not  be  square  integrable,  so  it  will  give  rise  to  a 
generalized  eigenfunction. 

5.  Effects  of  reductions  on  soiiton  solutions 

In  this  section,  we  analyze  how  different  kinds  of  reductions  affect  the  classification  of  the 
soiiton  solutions  to  a  nonlinear  equation.  This  criterion  is  tightly  connected  with  symmetries 
imposed  on  the  auxiliary  linear  problem  (the  zero  curvature  condition).  We  shall  consider  in  the 
next  subsection  types  of  solitons  which  differ  from  one  another  in  the  number  of  eigenvalues 
associated  with  them  doublet  solitons  associated  with  two  purely  imaginary  eigenvalues 
X*  =  iiu  and  quadruplet  solitons  associated  with  four  eigenvalues  situated  symmetrically 
with  respect  to  the  real  and  the  imaginary  axis  in  C.  This  is  the  case  when  a  H2  x  H2  reduction 
is  in  action.  Such  a  type  of  reduction  is  compatible  with  the  Lax  representation  of  a  NLEE  to 
have  a  dispersion  law  obeying  /(—X)  =  -/ (A).  The  ;V-wave  equation  is  a  typical  example 
of  NLEE  to  admit  a  variety  of  Z2  reductions  [27]  since  fu- wM  =  —A./. 

It  is  also  possible  to  impose  reductions  that  do  not  affect  the  spectral  parameter.  Thus, 
imposing  such  reduction  on  the  generalized  ZSS  related  to  the  extenor  automorphism  of  sl(n) 
we  can  effectively  reduce  it  to  the  generalized  ZSS  system  for  its  subalgebras  so(n)  and  to  the 
symmetric  space  of  BD  I  type. 
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For  further  convenience  we  shall  say  that  the  matrix  X  belongs  to  the  orthogonal  algebra 
jo(rt)  if 


* + = o. 

where  S(nn)  is  defined  by 


r.(n)  ^.(n) 


>0 


_ 

^0  — 


(159) 


n+ 1 

sT  =  £(-')I+lCV,.  (160) 

j=1 

for  n  =  2r  +  1  and 

C  =  E(-l)J+'(C+l-,  +  Cl-rJ  HOI ) 

I 

for  n  =  2 r.  With  this  definition  of  orthogonality  the  Cartan  subalgebra  generators  are 
represented  by  diagonal  matrices.  By  above  we  mean  n  x  n  matrix  whose  matrix 
elements  are  (£<*>),.  =  Ssi8pj. 

In  order  to  get  the  ZSS  related  to  the  symmetric  space  of  BD  I  type  we  have  to  specify 
7  =  diag(l,  0, . . . ,  0,  —  1)  and  take  q(x,  t)  =  [7,  Q(x ,  /)]  where  Q(x ,  /)  e  so(n).  In  this 
case  the  NLEE  with  linear  dispersion  law  become  trivial.  However,  one  can  consider  NLEE 
with  quadratic  and  cubic  dispersion  laws  which  are  multicomponent  generalizations  of  NLS 
and  mKdV  equations  respectively. 


5. 1.  N-wave  system  related  to  so{ 5) 

From  now  on  we  shall  focus  our  attention  on  a  N- wave  equation  related  to  the  jo(5)  algebra. 
The  corresponding  Lax  pair  is  given  by  (3),  (4)  where 

f/U,  /,  X)  =  [  J ,  0(jc,  /)]  -  XJ,  F(jc,  /,  X)  =  [/,  G(jc,  /)]  -  XL 

(162) 

J  =  diag(7),  72, 0,  -72,  —7,),  /  =  diag(/,,  /2, 0, -/2,  -/i), 

and  Q(x ,  /)  is  a  function  taking  values  in  so(5).  This  algebra  has  two  simple  roots 
a\  =  e\—e2,OL2  —  ?2,and  two  more  positive  roots:  or  i  -hof2  =  e\  andori+2or2  =  ei+^2  =  amax. 
When  they  come  as  indices,  e.g.  in  Qa  we  will  replace  them  by  sequences  of  two  integers: 
a  — ►  kn  if  a  =  ka\  +  na2 .  Moreover,  we  are  going  to  use  the  auxiliary  notation 
kn  =  —ka \  —  fl(*2.  Thus  the  /V-wave  system  itself  consists  of  eight  equations.  Half  of 
them  read 

•(•A  -  Ji)QwAx ■  0  ~  i(/i  -  h)Qio.x(,x,t)  +kQn(x,  OGgyU.  t)  =  0. 

•>/l  01  uC*.  0  ~  'A  011„t(X'  0  —  *(Gio0oi  —  0i20or)(^.  t)  =  0, 

(163) 

i(7i  +  Ji)Q\2.t(x<  t)  -  i(/i  +  h)QnAx*  O-kQ u(jf,  t)Qox(x.t)  =  0, 
iJlQoiAx**)  -  ^2Goi„rU,/)+/:(0TT0i2  +  0TO01l)(Jf’O  =  0, 

where  k  :=  J\  /2  —  72/|  is  a  constant  describing  the  wave  interaction  The  other  four  equations 
can  be  derived  from  those  above  by  using  the  formal  transformation  Qkn  «-►  Q-k One  is  able 


24 


J.  Phys.  A:  Math.Theor.  41  (2008)  315213 


V  S  Gerdjikov  et  a! 


to  integrate  the  system  by  applying  the  already  discussed  ideas — dressing  method,  etc.  For 
that  purpose  we  make  use  of  the  dressing  factor  (38).  The  one-soliton  solution  reads 

Qio(x.t)  =  X  -X~- (e~'a'z,~XZi)n0.  i  m0,2  +  e'a’Zi~Xz>  'no.mo.s). 

( m\n } 

Qu(x,i)=  -  -  -  (e  ,x'l'no,imo.3 -e~a  z'n0jm0,s), 

(m\n) 

Q\2(x,t)  =  ^y-(e~,a*Zl+x  Zl)»o,iw*o.4  +  e-,(A  l'+x'Z2)n0.2mo.s).  (]f4) 

X"  —  ♦ 

GoiU.  0  =  T  ,  .  (e~'x>Z2H0.2Wo,3  +  e~'x  Zjno.3«o,4). 

(m\n) 

5 

{m\n)  =  e-,(X*-A  z*  =  7**  +  /*r.  &  =  I . 2. 

*=i 

The  other  four  field  can  be  formally  constructed  by  doing  the  following  transformation: 

Qkn  <->  e-,rz‘  +*  e'^2*.  n0j  **  m0j. 

For  the  typical  Z2  reduction  ( 1 20)  of  course  we  must  choose  K{)  =  diag(ej  ,  e 2 ,  1 ,  £2^1 )  € 
50(5)  where  6,  =  ±1 .  Asa  result  7  and  /  become  real  valued  and  KqQ^  Kq1  =  —  (?,  i.e., 

Qlo  =  0t2  =  — 6l620l2»  (?TT=“€l(?n<  Qoi=-^2fioi. 

(165) 


The  corresponding  4-wave  system  takes  the  form 

i(*/i  -  J2)Q\oAx*0  -i(/i  =0, 

i*/i  Gn,fC*\  0  —  i^i  0ii,*C**  0  “  £(2io2oi  +  £iQ\2Qq\)(x'  0=0, 

(loo) 

\{J\  +  Ji)QnAx,  t)  -  i (/,  +  /2)0i2.,U,  0  -  *Gn(*,  OQoiix,  t )  =  0, 

\JlQoxAXs  t)  —  \hQo\,x(x%  t)  —  k€\(Q\{  Qn  +€2G|0Gh)(x,  t)  =  0. 

The  particular  case  €\  =  €2  =  1  occurs  in  Raman  scattering  [28]. 

The  corresponding  one-soliton  solution  is  obtained  from  (164)  imposing  X+  =  (X~)*  = 
fi  +  \v  and  |mo)  =  Agoing).  Here  we  just  note  that 

(m|n)  =  €  1  (e”2l,Zl  |/*o.i  1 2  +  e2l,Zl  |«o.5 12)  +  f2(e“2l,Zj  |no.2l2  +  e2vZl  |«o.4 12)  +  |«n.3l2.  (167) 

Taking  €\  =  €2  =  1  we  find  that  (m\n)  is  positive  for  all  x  and  t  If  €\ €2  =  —1  the  product 
( m\n )  may  vanish  for  finite  jc  and  /,  i.e.  the  corresponding  soliton  is  singular. 


5.2.  Z2  reduction  related  to  Weyl  group  elements 

Here  we  consider  several  Z2  reductions  related  to  Weyl  group  elements.  The  first  one  is  of  the 
type 

K\ U*(X*)K;'  =  -t/(X)  =>  KiQ'Kj'  =  Q,  KXJ*K~'  =  -J,  (168) 

where  K\  corresponds  to  the  Weyl  reflection 


Ki  = 


/0 

-1 

0 

0 

V  o 


-1  0  0  0\ 
0  0  0  0 
0  10  0 
0  0  0  1 
0  0  10/ 


(169) 
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This  reduction  leads  to  the  requirements  J2  =  —  JJ,  h  —  — /*  for  J  and  I  respectively,  and 
the  following  ones  for  0 , 

<2To  =  e;0.  <2o.  =  -<2T,.  Q'n  =  -Qr-  Qm  =  -Qh-  <%  =  -< 2n. 

The  Z2  reduced  integrable  system  consists  of  the  following  five  equations  for  three 
complex  gio,  0n,  0yy  and  two  real  fields  Q \2  =  iqi2,  Qjj  =  ^72: 

27io0io.r(**  f)  —  2/|,o(2io.jc(^.  0  —  koQu(x,  OGjfC**  0  =  0, 

J\Qnj(x<  t)  —  I\Qi\.x(x>0+ko(Q\QQ*n  —  iqi2  0yy)U.  0=0, 

2ii.iqi2.,(jr,l)  -  2/|.iqi2,.,(x,/)  -£oK?n(*.  ')l2  =  0.  (170) 

J\  Qfij (-*  -  0  h  Qu.x (-*»  0  +  ko(  01O0JJ  —  iqj2^ 1 1  )(■***  0  =  0, 

27,.,  Qti,,(x.  i)  -  2/|,i  Qf zx(x,  i)  -  ')|2  =  0, 

where  J \  =  Jqj  +  iJ|j,  I\  =  Iqj  +  i/|j  and  the  interaction  constant  ko  =  — i k  is  real. 

Next  we  consider  a  Z2  reduction  of  the  type 

J.  K2Q'K?  =  -Q, 


KiUHWKZ1  =  U(k) 

=> 

k2j*k 

with  K  =  Sf2 ,  i.e., 

n 

0 

0 

0  0\ 

0 

0 

0 

1  0 

k2  = 

0 

0 

-1 

0  0 

0 

1 

0 

0  0 

\0 

0 

0 

0  1/ 

C,0  —  —Q 12' 
Qh  =  2o- 


Qn  =  -Ql 


(171) 


Therefore  we  have  the  following  relations: 

j;  =  Ju  j;  =  -h. 

<2tt=0m»  0oi  =  0oi< 

Thus  we  derive  the  following  5-wave  system: 

U\  -  J2)Q\oAx*0  ~  Ui  -  h)Quu(xJ)  +  k)Q\\(x*t)Qm(x>  0  =  o, 

J\  Q\\j(x*  0  —  UQ  H.jrUi  0  —  ^o(  010  001  “  Q\2Qo\)(x*  0=0, 

(Ji  +  Ji)Q\2A*>0  -  (A  +  h)Q\2.xU<  0  -*o0n(*<O0oi(-*.O  =0,  (172) 

^2  0Ol.rU*  0  —  ^2  0Ol.x(  v,  0  +  ^o(0|i  012  —  012  011  )(*<  0=0, 

JiQcftjix* 0  “  ^2 0oT.x (-* » r)  ”  M 01 1  0m  “  0io0i  i  0  =  0, 
where  £o  =  —  i£  is  real. 

The  last  two  reductions  requested  complex  valued  J  and  /.  As  a  result  the  direct  and 
inverse  spectral  problems  for  the  corresponding  Lax  operator  L  become  more  complicated 
[26].  In  particular,  the  continuous  spectrum  of  L  fills  up  a  bunch  of  lines  in  the  complex 
A-plane  intersecting  at  the  origin.  The  construction  of  the  corresponding  fundamental  analytic 
solutions  and  the  dressing  factors  requires  additional  care  and  will  be  discussed  elsewhere. 

5.3.  Reductions  of  MNLS-type  systems  related  to  $o(5) 

MNLS  equations  related  to  so(n  +  2)  algebras  have  a  Lax  representation  of  the  following  type. 

L  =  \dx+q{x,t)-\J,  (173) 


M  =  id,  +  Vq(jc,  0  +  Vi(.v,  t) X  —  X2J. 


(174) 
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Vi(x,t)  =  q(x,t),  VoC*.  0  =  iad  jdxq  +  j[ad  jq,  q).  (175) 

Here  J  is  the  element  of  the  Cartan  suhalgebra  of  so(/i)  dual  to  €\  and 
/  0  qr  0  \ 

q  :=  p  0  i„q  ,  J  :=//,=  diagd.0 0,-1),  (176) 

V  0  p7^  0  / 

where 

q  =  (<?2 . qn*\)T  •  P  =  (P2 . Pn*\)T  ■ 

Because  of  the  specific  choice  of  J  these  MNLS  equations  can  be  viewed  as  connected 
with  the  BD.I  symmetric  space.  In  most  of  this  section  we  will  use  the  simplest  case  with 
n  =  3,  though  all  results  can  be  easily  generalized  for  any  rank  of  the  algebra  The  MNLS 


system  itself  takes  the  form 

'q2.t  +  q2.xx  +  2q$p2  +  <liP*  +  2$3<?2P3  =  0.  (177) 

i<?3,/  +  *?3.jcjr  +  2q2q4pi+2q2p2qi+2q3qApA+qlp)  =  0.  (178) 

+  qx.xx  +  2qi  P4  +  qjp2  +  ZqwP)  =  o.  ( 1 79) 

ip2.t  -  P2.XX  -  2p\q2  -  p]q 4  -  2pip2qi  =  0.  (180) 

ipi.t  -  Pi.xx  -  2p2pAqy  -  2p2q2Pi  ~  2pAqApy  -  p]qy  =  0,  (181 ) 

ip4,<  -  P4.xx  ~  2pjqA  -  pjq2  -  2 pspAqs  =  0.  (182) 

Its  one-soliton  solution  derived  via  dressing  procedure  reads 

Qk  = - - - (e  lk  <t+x  ^nQA/nO'k  +  (—  0* e  lA  U+A  r)/to.6-*wo.5).  (183) 

A 

Pk  =  - — r— (e'A  <Jr+X  'W*">o.i  +  (-1  )*  e'r<Jr+r,)no.5mo.6-*).  (184) 

A 

where 

4 


a  —A.*  )(.*+( A.  +X*)f)  ,  \  „  ,  A.  )(t+(X  +A.*)/) 

^  —  e  no.iWoj  +  ^/io.aWo,*  +  e  wo.s^o.s* 

2 

Consider  a  Z2  reduction  of  the  form 

KUHx.k*)K-'  =  U(x,k),  i.e,  Kq]K~'=q.  KJ*K~'  =  J.  (185) 

If  J  is  real  there  are  two  inequivalent  choices  for  K  satisfying  K  J K~l  =  J:  the  first  one 
is  the  typical  reduction  via  an  element  of  the  Cartan  subgroup,  K \  =  diag(ei,  € 2 ,  I,  €2,61), 
ef  2  —  U  the  second  one  is  a  reduction  via  a  Weyl  reflection  Se2 ,  see  (171).  The  reductions 
obtained  with  K\  and  K2  respectively  give 


P2=€\€2q^  P)=€  1^3,  P4=€\€2q4\  (186) 

P2  =  qh  P^  =  -<73  ’  P4=ql  (187) 

The  former  X2  reduction  leads  to  the  following  three-component  system  of  NLS  equation: 

'q2.i  +q2.xx  +2(i(e2\q2\2  +  \q-i\2)q2+(l(2qjqA  =0,  (188) 
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iqit+qxxx  +  261<72<74<73  +  €i(2€2|<72|2  +2e2|<74|2  +  \qi\2)q*=0,  (189) 

i<74,r  +  <74,jct  +  i  (^2 1<74 1 "  +  l<?3|2)<74  +  ^i^2<73<72  =  0*  (190) 


In  order  to  integrate  this  system  we  apply  the  dressing  procedure  with  the  dressing  factor  (38). 
Taking  into  account  that  in  the  reduced  case  we  have  the  relations 

A+  =  (X~)*  =  m  +  iv,  |m0)  =  ^i|wj),  (191) 

we  find  that  its  soliton  solution  is  given  by 


~  V  -iLt(x-vt) _v(jt— uf)M  „♦  ,  -  -v(.r-uOM  *  \ 

<72  =  — “ —  C  (^2C  ^0.1«o,2  +  ^le  Wo,4«o.5/’ 

<73  =  ^e-i'iU-w)(e',<t-"'V1^,3  e-’,j:-“'»no.3«S.$)- 

q. ,  =  il!e-^'-''l(«2e'',-',ti„,ln(;4+(1e-’"-“"»„.!»;,s). 

A  —  €[  e2  u_“'>|no,il2  +€2(l«0.2|2  +  l"0.4|2)  +  l»0.3l2  +€|  c  “'Wr. 


At 

The  latter  reduction  gives  rise  to  another  inequivalent  system  of  three  NLS  equations 


(192) 

(193) 

(194) 

(195) 

(196) 


W2.f  +  <?2..rjr  +  2(<72<7X  -  kl|2)<72  +<?3<?2  =  0.  (197) 

+  q\xx  ~  24:444}  +  (24:44*  +  2q*qi  -  l43l2>43  =  o.  (198) 

i<74.r  +  q*.xx  +  2(4442  -  I4jI2)44  +  4.1 44*  =  0-  (199) 


Then  we  have  the  following  one-soliton  solution’ 


<72  =  +  c-v<'-“'*no.4ti(*,5).  (200) 

43  =  ^  c-*MUr-«»(e*Ur-«»Wain^3  +  e-^-“'>n0>3„*  j).  (201  ) 

<74  =  ^  e-iM(J!-  w  ’  (ev<j:_*')no.  i  «o,2  +  e-^-'VinJs).  (202) 

A  =  e2v(r— "r) |/?0. 1 12  +  (wo.2^o.4  +  rto.2^o.4)  —  l'i(ul2  +  e  2vU  w0|wo.st2-  (203) 

As  in  the  previous  example  the  polarization  vectors  are  interrelated  via  (191)  which  in  this 
case  reads 


m0,k 


l(-l)*n 


0.6-*' 


k  =  1,5 
k  =  2,3,4. 


Next  we  consider  a  Z2  x  Z2  reduction,  which  is  a  combination  of  reductions  with  K\  and 
K 2.  This  is  possible  only  for  €1  =  —  1 .  Then 


PlA  =  ^2^2.4  ’  <72  =  “*2<74<  />3  =  (204) 

and  we  obtain  the  following  two-component  NLS  system: 

i<?2.f  +  qz.xx  -  2(^2 1<72 12  +  l<73l2)<72  +  4242*  =  (205) 

•43.f  +  qxxt  -  (4€2l<72l*  +  l43l2)43  +  2f2(42)243*  =  °-  (206) 


28 


J  Phys.  A.  Math.  Theor.  41  (2008)  315213 


V  S  Gerdjikov  et  at 


Its  one-soliton  solution  takes  the  form 

<73  =  ^  e-^<x-”,(e‘’(x-“',n0.i<3  +e-’,Jt-“',/i0.3<5). 

A  =  e2u(x-"'>|/i0  l|2  -  2e2|«o.2|2  -  l«o.3l2  +e^2''<x‘“')|/'o.5l2. 

v2-p2  „ 

v  = - ,  u  =  -2/x, 

M 


where  we  have  made  use  of  the  following  relations: 


"to.*  = 


"o.*- 

“€2«S.* 

-”5,3’ 


n0.6 -A’ 


A:  =  1,5 
A:  =  2,  4, 
A:  =  3. 


(207) 

(208) 

(209) 

(210) 


5.4.  MMKdV  equations  on  symmetric  spaces  of  BD.l  type 

Multicomponent  MKdV  related  to  so(n  +  2)  algebra  possesses  a  Lax  pair  with  the  L  operator 
(173)  and  M-operator  in  the  form 

M\j/(x,t,  X)  =  \dt\j/  +  ( Vi)(jr,  /)  +  X Vi(x%  t)  +  X2  Vi{x%  t)  —  aV)^(a‘,  /,  X), 

V2(x,t)  =  q(x.t).  Vt(x,t)  =  iddj'dxq  +  jjadjV  <?(*.  0].  (211) 

Vo(x,t )  =  —  92,<7  +  5 [adj 'q.  [adjV  q(x,  /)]]  +  i[9t<7.  <7]- 

The  corresponding  MMKdV  equations  can  be  written  down  in  compact  form  as 

3,q  +(>lxxq  +  3(p,q)dxq  +  3(9, <7,  p)q  -3(dxqs0q)sop  =  0.  (212) 

3 tP  +  &xxx P  +  3(i>.  <7)3,  P  +  3(3 xp.  q)p  -  3  (3 xps0p)  s()q  =  0.  (213) 


Consider  a  Z2  reduction  of  the  type 

U'(n  =  UW,  =>  p  =  q\ 


(214) 


Then  we  obtain  the  reduced  system  of  equations 

3, <7  +9xxxP  +3|<7l23.*<7  +3(3 tq.  q*)q  -3(dxqsoq)soq*  =  0. 
Its  one-soliton  solution  reads 
—iu  Q-'^U-ut-So) 


4k 


cosh(2u(.x  —  vt  —  ?o))  +  C 


(cH»-w-fc)c.  +(_!)*  e-»<x-w-fe)£.n+ J_t)f 


(215) 


ftp.* 

v/i^.in^i’ 


A:  =  2 . n  +  1 


71+1 


C  =  E 

k=2 


M2 

2i«o.iiiwo.n+2r 


(216) 


v  —  v2  —  3/a2,  u  =  3u2  —  /a2. 


c  arg«o.i 

d0  =  - 

P 


2y 


kh),n+2 1 
1*0.1 1 


provided  we  have  hxed  argrt0,i  =  —  argM0>n+2  by  using  the  natural  U{  1)  symmetry  of  the 
solution. 

As  a  final  example  let  us  consider  MKdV  related  to  s<?(5)  with  a  Z2  reduction  of  the  type 
=  U(X)  =>  Kq'K-'  =  q,  KJK~'=-J.  (217) 
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We  choose  K  =  K0  o  We  where 


(° 

0 

0 

0 

f  0 

0 

0 

0 

-€l\ 

0 

1 

0 

0 

0 

0 

€2 

0 

0 

0 

= 

0 

0 

-1 

0 

0 

=>  K  = 

0 

0 

-1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

€2 

0 

(218) 


is  the  Weyl  reflection  with  respect  to  the  hyperplane  orthogonal  to  e\.  The  following 
interrelations  hold  true: 


<74  = -eK2<72’  <73  = -<T<7v  />4  =  -*i*2P2’  P)  =  -*\Pi-  (219) 

As  a  consequence  of  the  reduction  we  have 

k+  =  -(k~)\  \m)  =  K\n)\  (220) 

or 

a*)*  =  —A.*,  |n)  -SK\n)\  (m\  =  (m\*(SK)-'.  (221) 

Applying  another  Z2  reduction  of  the  type 

uT(-x)  =  -U(X),  =>  qT  =  -q .  (222) 

we  obtain  that 

r  =  |m)  =  |«).  (223) 

The  corresponding  system  of  MKdV  is 

qi.,  +  qi.xxx  -  3(<?2<73)*<73  +  3e  1<F2<73<72 <73. 1  -  bq\qi.x  =  0,  (224) 

<73.(  +  93, «jt  +  3£if2l<72l?<73  -  3(9293)<<?2  -  -  3^3..t  =  0.  (225) 


and  is  new  to  the  best  of  our  knowledge  Combining  both  reductions  we  again  may  have 
two  types  of  solitons.  The  doublet  soliton  corresponds  to  purely  imaginary  X±  =  ±iu  and 
|h)  =  SK  |/t)*  and  is  given  by 


pi<5o 


<72  = 


€ i  cosh  2u(.r  —  vt  —  to)  +  C 


D)C2+e-m-W^)C4)i 


(226) 


<73  = 


2iuc3  e,5°  sinh  u(.v  —  vt  —  f0) 
cosh  2u(.v  —  vt  —  fo)  +  C 
Itt 


(So  =  argNo,i  =  arg«0,5  = 


l  €  : 


2€2  Re(/i0.2»o.4)  +  l^tul2 


(227) 


2|/to.  1  1 1«0.5  I 

Cj  =  -61C[,  C\  =  -62C4,  C3  =  -C3,  C5  =  -6iC5, 

where  c*,  u  and  £o  coincide  with  those  in  the  previous  example  when  r  =  2.  From  (219)  it 
follows  that  <73  is  either  real  or  purely  imaginary  valued  function. 

Yet  another  possibility  to  ensure  compatibility  of  both  Z2  reductions  is  to  modify  the 
dressing  factor  into 


wtv,  t,k)  =  I  + 


A(x,t )  KSA*(Xi  t)SK 


SA{x,t)S  KA*(x,t)K 
+ 


k  +  k() 


k-K 


k-k() 

which  leads  to  the  quadruplet  soliton  solution 

<7 tv,  f)  =  [J%  A  —  KSA'SK  -  SAS  +  KA*K] tv,  t). 


(228) 


(229) 
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In  order  to  find  it  we  have  to  calculate  the  matrix  A{ a,  /).  For  that  purpose  it  proves  to  he 
convenient  to  decompose  A  into  A  =  X FT ,  where  X  and  F  are  rectangular  matrices  of  rank 
5  ^  r  and  Xo  =  p  +  iv.  It  can  be  checked  that  F(a,  t)  —  e‘(XoJr+x®,),/ Fo.  Fq  =  const.  In  the 
simplest  5  =  1  case  for  the  factor  X  one  can  obtain  the  following: 
a*F  +  bSKF*  -  cK F* 


X  = 


\a\2  +b2  -  c2 


where 

FtF 


a  = 


\F0.M 


b  = 


2X0 

(F'SKF) 

2\v 

(F'KF)  _ 

2/i 


il^O.1^0.51 


(cosh  2(0r  -  i(p\)  +  Ca). 
(cosh  20r  +  Cb ), 


\FoaFos\ 


(cos  20i  +  Cr), 


Ca  = 

ch  = 

Cc  = 

<t>\ 


Fh  j  Fh  +  Fh 

21^0.1^0.51  ' 

2Re(Fn*2Fo,4)  +  |/ro,3l2 
2|F0.,F0.j| 

|Fo.2l2-|Fo.3l2  +  |Fo,4|2 

2|F0.,F„.5| 


/  #  arg  ^0.5  \ 

=  /T'“ - — j 


.  (  1  .  Ifiul\ 

0R  =  y^-wr-  — ln— J. 

arg  F0.i  =  -arg  F0.5. 

Substituting  this  result  into  (220)  and  choosing  €\  =  62  =  1  one  derives 

2v/|/r0,l  ^0,2  ^0,4  ^0,5 ' 


<n  = 


</3  = 


a*  cosh(0R  —  i0j  )  —  b( cosh  (0R  +  i0j+)  +  cosh  (0R  —  i0,  )) 

(230) 

\m{(b  +  c)  sinh(0R  +  i(p\)  —  a*  sinh(0R  -  i0i)}Fo.3,  (231) 


\a\ 2  +  b2  —c2 

—  a  cosh  (0R  +  i0*)  +  c(  cosh  (0R  +  i0,“)  —  cosh  (0R  —  i^*))} , 
2i>/|Fo.,F0.s 


\a\2  +  b2  —  c2 
where  we  have  used  the  following  notation 
,±  .  ,  1  .  |Fo,2| 

0d  —  0r  i  —  In - , 

R  w  2  if0.4i 


4>t  -  0i  ±  arg  F0,4. 


(232) 


6.  Discussion  and  further  studies 

Here  we  shall  outline  some  further  topics  which  could  be  studied  and  which  could  lead  to  a 
deeper  understanding  of  these  soliton  properties. 

The  first  obvious  remark  is  that  sl(n)  contains  as  subalgebras  also  so(p )  and  sp{p) 
subalgebras.  So  it  will  be  interesting  to  specify  the  conditions  under  which  L(k)  has  solitons 
of  type  so(p)  or  sp(p). 

The  second  remark  of  the  same  nature  is  that  one  can  start  with  L(X)  related  to  any  simple 
Lie  algebra  g  (e.g.,  so(rt)  or  sp(n)).  The  analysis  of  soliton  solutions  for  NLEE  related  to 
such  systems  require  deeper  knowledge  of  Lie  algebras  and  their  representations. 

The  explicit  form  of  the  corresponding  /V-wave  system  related  to  these  algebras  has  been 
reported  in  [4,  5,  13],  see  also  [29,  30]  What  could  be  done  is  to  analyze  the  structure  of 
its  soliton  solutions  [29,  30]  which  are  more  involved,  especially  in  the  case  when  additional 
reductions  are  imposed.  The  construction  of  the  corresponding  dressing  factors  is  more 
complicated,  but  one  can  expect  that  new  interesting  types  of  integrable  cubic  and  quartic 
interactions  could  be  obtained. 

Another  important  question  is  the  study  of  solitons  constructed  by  projectors  of  rank  2 
and  higher.  Such  solitons  have  been  already  used  by  Wadati  et  al  [31]  in  describing  BEC 
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with  hyperfine  structure  F  =  1 .  Now  each  soliton  will  be  parameterized  by  two  polarization 
vectors;  the  corresponding  eigensubspace  will  be  two  dimensional  too.  Among  the  various 
types  of  rank-2  one-soliton  solutions,  there  will  be  various  possible  configurations  for  the  two 
polarization  vectors.  An  example  of  a  dressing  factor  w(jc,  f,  X)  constructed  by  a  projector  of 
second  rank  is  the  following  one: 


«(jc,  /,  X) 


Such  type  of  a  dressing  factor  was  used  in  [32]  to  derive  the  soliton  solutions  to  a 
multicomponent  Schrodinger  equation  relate  to  symplectic  algebra  sp{ 4). 

It  is  known  in  general  how  the  machinery,  well  understood  for  the  AKNS  system  such 
as  Wronskian  relations,  expansions  over  ‘squared  solutions’,  etc  can  be  generalized  also  for 
these  types  of  systems.  The  dressing  method,  after  some  modifications,  can  also  be  applied, 
leading  to  the  derivation  of  their  soliton  solutions. 

An  interesting  problem  is  the  study  of  how  the  different  possible  reductions  (see,  e.g., 
[29])  of  these  systems  will  influence  the  number  of  one-soliton  types. 

Soliton  interactions  for  various  different  types  of  solitons  of  these  systems  also  present 
interesting  problems.  From  the  results  known  for  the  N- wave  systems  [9,  10]  it  is  known  that 
new  effects  in  soliton  interaction,  such  as  soliton  decay  and  soliton  fusion  may  arise. 
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Appendix  A.  The  orthogonal  algebras  and  BD.I  symmetric  spaces 

The  definition  of  orthogonality  used  in  equations  ( 1 59}~(  161)  has  the  advantage  that  the  Cartan 
subalgebras  can  be  represented  by  diagonal  matrices. 

It  is  well  known  that  the  two  cases  of  odd  and  even  n  are  substantially  different.  The 
algebras  so(2r  +  1)  are  known  as  the  Br  series  according  to  the  Cartan  classification  [33]. 
Assuming  that  the  reader  is  familiar  with  the  basics  of  simple  Lie  algebras  here  we  just 
recall  that  the  root  systems  of  these  algebras  consist  of  A"  U  A+  where  A-  =  —  A+  and 
A+  =  {ej  ±  ejyej]  with  l  ^  /  <  j  ^  r.  We  provide  also  the  Cartan-Weyl  basis  of  these 
algebras  in  the  typical  (2r  +  1  )-dimensional  representation 

Eei.ej  =  Eu  -  S()EjiS()  =  Eij  -  (-1  )i+JE]-, 

Egi+tj  =  Ei]  -  SoEjiSo  =  Eij  -  (-1  )i+jEj:h 

(A.  1) 

Ee i  =  Ej  r+\  —  S{)Er+\ jSq  =  £/,r+i  —  (~1)  Er+\ j, 

Hrj  =  Ejj  SoEjjSo  —  Ejj  —  Ej  j.  E-a  =  Ea  . 

Each  symmetric  space  is  obtained  by  applying  a  Cartan  involution  This  involution 
splits  the  group  SO (2r  +  1)  into  a  subgroup  SO (2r  —  1)  (8)  50(2)  and  a  factor  space 
50(2r  +  l)/50(2r  —  1)  8)  50(2).  Effectively  the  system  of  positive  roots  is  split  into 
A+  =  Aj  U  Af ,  where  the  subsets  of  roots  are  defined  as  follows. 

Aq  =  {ei  ±  eh  *>/,  2  ^icy^r),  A\={ex±ej ,  ex%  2^j^r).  (A.2) 

In  fact  Aq  contains  all  positive  roots  that  are  orthogonal  to  eu  while  A|  contains  all  positive 
roots  that  have  scalar  product  equal  to  1  with  ex. 
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Similarly  one  can  consider  the  algebras  so(2r)  which  are  known  as  the  Dr  series  Their 
Cartan-Weyl  basis  in  the  typical  2r-dimensional  representation  is  given  by 

Efi-ej  =  Eij  -  SoEjiSo  =  Ejj  -  (-1  )i+jEhh 

E<i+fJ  =  Ejj  -  SqEj;So  =  Egj  -  (- 1 ),+;  Ejj%  (A. 3) 

Hfj  =  Ejj  —  SoEjjSo  =  Ejj  —  Ejj.  E-a  =  Ea. 

The  corresponding  symmetric  space  is  SO (2r  +  2)/SO(2r)  0  50(2).  The  system  of 
positive  roots  is  split  into  A+  =  A{*  U  Af,  where 

A  l  =  {ei±ej.  2^i<j^r),  A|  =  {c,±^,  2  <  j  <  r)  (A.4) 

Again  Aq  contains  all  positive  roots  that  are  orthogonal  to  e\%  while  Aj  contains  all  positive 
roots  that  have  scalar  product  equal  to  1  with  e\. 

Appendix  B.  Higher  spin  representations  of  sl( 2) 

Here  we  construct  the  5/(2)  dressing  factor  (130)  for  higher  spin  representation  of  5/(2).  This 
we  do  by  using  completely  symmetric  tensor  powers  of  tt. 

Using  the  way  how  u  acts  on  basic  vectors  ex ,  i  =  1 , 2  in  C2 

ue  1  =  u\\e\  +  u2\e2*  ue2  =  u[2e\  +u22e2.  (B.l) 

The  normalized  basis  in  the  completely  symmetrized  tensor  product  of  C2  O  C2  is  given  by 

€i=e\®e\%  €2  -  -j^(e2  ®e\  +e\  ®e2),  €i  =  e2®e2.  (B  2) 

Since  u(x ,  /,  X)  belongs  to  the  group  SL( 2)  it  must  act  on  the  basic  elements  as  follows*. 


U0)(x,  \)(ej  ®ej)  =  (uej)  0  ( uej ). 


(B.3) 


Thus  we  obtain  the  following  representation  for  the  dressing  factor  for  spin-1  representation 


U0)  =  u  O  u  = 


/  U2u  sf2li  1 1 M 12  u] 2  \ 

y/2U\\U2\  Ml,W22+M|2M2I  V2u22U\2 

t\l  V2u22U2\  u\2 


V 


(B,4) 


Now  we  have  to  insert  the  expressions  for  utj  in  terms  of  c\ ,  tik  and  tn *  and  thus  we  derive 


U 


(3) 


U  o  U  =  Ci7rf3)  +  7Tq3)  +  —  71 
C 1 


where  the  projectors  na.  a  =  -1,0,  1  are  all  rank- 1  projectors  of  the  form 

,3, 

= - 


_  (3) 


K’K®)' 

40)\mO)\ 


_  (3)  _ 
71  {)  ~ 


(3)1 


where  (A#,(3)| N{3))  =  (M™\ N™)  =  (M i3) | N{j i)  =  (mmi  +m2n2) 


KK 

(Of) 

(3)U;(3)\ 


(Mi3)|A/(3)\ 


(B.5) 


(B.6) 


0=  \yf7nx 


n  2 


N^)=\-V2m 


\tn2 


(jlmwi  \ 

n2m2  —  1  I 

-sflnjmy  / 


(Af J3> |  —  (m2,  V2m\m2 ,  m\),  {^-i\  =  {n2'  —  \/2/t|«2.  rtf), 

(A/q3)  =  (>/2n2m\,  n2rn2  —  n\tn\ ,  —V2n\m2). 


(B.7) 
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Similarly  the  normalized  basis  in  the  completely  symmetrized  tensor  products  of 
C2  o  C2  o  c2  is  given  by 

€\  =  €\  0  e\  0  *|,  €2—  -^(cj  0*1  ®  *2  +  *1  ®*2  ®*1  +  ^2  ®  ^I  ®*l)< 

63  =  ^-(^1  0  e2  0  e2  +  e2  0  0  ^2  +  ^2  ®  ^2  ®  *i)»  f 4  =  e2  0  e2  0  e2. 

Therefore  the  dressing  factor  obtains  the  form 


Ui4)  = 


/  u\x  V3u2nui2  V3uuu2l2 

V3u2uU2i  U2uU22  +2mMM12W21  W22«2I  +2wmMi2W22 
V3w  1 1  «!2«21  +2wiiM21«22  U  \  \  W22  +  2w  12«21  "22 

V  u\x  V3u22lu22 

It  can  be  decomposed  into 

f/(4)  =  n%2c~V2 

where 

,4)  _ 


\ 


\/3m2I«22 


\/3w^2W22 

\/3w 

„3 


12"  22 


*22 


+  tt(4)  r  I/24-jr(4)r,/24-7r(4)r3/2 
+  7T-l/2C  +  JZl/2C  +Jr3/2C 


a  =  —3/2,  —1/2,  1/2,  3/2. 


The  (co)  vectors  |/Va(4))  ((ma|)  are  given  by 


N 


(4)  ' 
-3/2, 


=  —\fitn\m\,  \Z3m2m]'  —m])T , 


(M%2  =  (n2'  —  >/3«2wg  \^>n2n].  -/13) 

|  A^i4)/2)  =  (V5zi,m2,  m2{n2m2  —  2n\tri\),  —  ln2m2),  y/2n2m\)T , 

I  =  (>/3«2mi,  H2(w2Wi2  ~  2nimj),  /1 1  (zi  1  zzj  1  -  2/i2tfi2)«  \/3rt2/W2) 
|/V$)  =  ( V3n2]m2 ,  — /i  1  (zi  j z/j  1  -  2n2m2),  n2(n2m2  -  2n\ ni\),  -y/2n\m\)7 
{M\A/2\  =  (V3n2m2^  m\(n]tn\  —  2n2m2),  m2(n2m2  -  2n\tn\),  — \/3/ii//i2) 
1^3/2)  —  (**]*  V3/?j/I2,  V3/1|A12,/I3)r, 

Cl 


(A/3^2 1  =  (wj,  \/3mJ«i2,  -yiniiWj,  mj). 


(B.8) 


(B.9) 


(B  10) 


(B  11) 
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Squared  eigenfunctions  are  quadratic  combinations  of  Jost  functions  and  adjoint 
Jost  functions  which  satisfy  the  linearized  equation  of  an  integrable  equation.  They 
are  needed  for  various  studies  related  to  integrable  equations,  such  as  the  develop¬ 
ment  of  its  soliton  perturbation  theory.  In  this  article,  squared  eigenfunctions  are 
derived  for  the  Sasa-Satsuma  equation  whose  spectral  operator  is  a  3  X  3  system, 
while  its  linearized  operator  is  a  2X2  system.  It  is  shown  that  these  squared 
eigenfunctions  are  sums  of  two  terms,  where  each  term  is  a  product  of  a  Jost 
function  and  an  adjoint  Jost  function.  The  procedure  of  this  derivation  consists  of 
two  steps:  First  is  to  calculate  the  variations  of  the  potentials  via  variations  of  the 
scattering  data  by  the  Riemann-Hilbert  method.  The  second  one  is  to  calculate  the 
variations  of  the  scattering  data  via  the  variations  of  the  potentials  through  elemen¬ 
tary  calculations.  While  this  procedure  has  been  used  before  on  other  integrable 
equations,  it  is  shown  here,  for  the  first  time,  that  for  a  general  integrable  equation, 
the  functions  appearing  in  these  variation  relations  are  precisely  the  squared  eigen¬ 
functions  and  adjoint  squared  eigenfunctions  satisfying,  respectively,  the  linearized 
equation  and  the  adjoint  linearized  equation  of  the  integrable  system.  This  proof 
clarifies  this  procedure  and  provides  a  unified  explanation  for  previous  results  of 
squared  eigenfunctions  on  individual  integrable  equations.  This  procedure  uses 
primarily  the  spectral  operator  of  the  Lax  pair.  Thus  two  equations  in  the  same 
integrable  hierarchy  will  share  the  same  squared  eigenfunctions  (except  for  a  time- 
dependent  factor).  In  the  Appendix,  the  squared  eigenfunctions  are  presented  for 
the  Manakov  equations  whose  spectral  operator  is  closely  related  to  that  of  the 
Sasa-Satsuma  equation.  ©  2009  American  Institute  of  Physics. 

[DOI:  10.1063/1.3075567] 


I.  INTRODUCTION 

Squared  eigenfunctions  are  quadratic  combinations  of  Jost  functions  and  adjoint  Jost  functions 
which  satisfy  the  linearized  equation  of  an  integrable  system.  This  name  was  derived  from  the  fact 
that,  for  many  familiar  integrable  equations  such  as  the  Korteweg-de  Vries  (KdV)  and  nonlinear 
Schrodinger  (NLS)  equations,  solutions  of  the  linearized  equations  are  squares  of  Jost  functions  of 
the  Lax  pairs.  Squared  eigenfunctions  are  intimately  related  to  the  integrable  equation  theory. 
For  instance,  they  are  eigenfunctions  of  the  recursion  operator  of  integrable  equations.8^  They 
also  appear  as  self-consistent  sources  of  integrable  equations.  A  more  important  application  of 
squared  eigenfunctions  is  in  the  direct  soliton  perturbation  theory,  where  squared  eigenfunctions 
and  their  closure  relation  play  a  fundamental  role.  4 

Squared  eigenfunctions  have  been  derived  for  a  number  of  integrable  equations  including  the 
KdV  hierarchy,  the  Ablowitz-Kaup-Newell-Segur  (AKNS)  hierarchy,  the  derivative  NLS  hierar¬ 
chy,  the  sine-Gordon  equation,  the  massive  Thirring  model,  the  Benjamin-Ono  equation,  the 
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matrix  NLS  equations,  and  the  Kadomtsev-Petviashvili  equation.4,6,9  18  Several  techniques 
have  been  used  in  those  derivations.  One  is  to  notice  the  commutability  relation  between  the 
linearized  operator  and  the  recursion  operator  of  the  integrable  equation.  Thus  squared 

eigenfunctions  are  simply  eigenfunctions  of  the  recursion  operator.  The  drawback  of  this  method 
is  that  one  has  to  first  derive  the  recursion  operator  of  the  integrable  equation  and  determine  its 
eigenfunctions,  which  can  be  highly  nontrivial  for  many  equations  (see  Ref.  20,  for  instance). 
Another  technique  is  to  first  calculate  variations  of  the  scattering  data  via  the  variations  of  the 
potentials,  then  use  the  Wronskian  relations  to  invert  this  relation  to  get  the  variations  of  the 
potentials  via  variations  of  the  scattering  data  This  Wronskian-relation  technique  involves  some 
ingenious  steps  and  could  run  into  difficulties  too  if  the  problem  is  sufficiently  complicated.4  The 
third  technique  is  related  to  the  second  one,  except  that  one  directly  calculates  the  variations  of  the 
potentials  via  variations  of  the  scattering  data  by  the  Riemann-Hilbert  method.4  This  third 
method  is  general  and  conceptually  simpler.  Regarding  the  second  and  third  methods,  however, 
one  question  which  was  never  clarified  is  why  the  functions  appearing  in  the  variation  relations  are 
indeed  squared  eigenfunctions  and  adjoint  squared  eigenfunctions  which  satisfy,  respectively,  the 
linearized  equation  and  adjoint  linearized  equation  of  an  integrable  system.  In  all  known  ex¬ 
amples,  this  was  always  found  to  be  true  by  direct  verifications.  But  whether  and  why  it  would 
remain  true  for  the  general  case  was  not  known. 

Recently,  we  were  interested  in  the  Sasa-Satsuma  equation  which  is  relevant  for  the  propa¬ 
gation  of  ultrashort  optical  pulses.  This  equation  is  integrable  Two  interesting  features  about 
this  equation  are  that  its  solitons  are  embedded  inside  the  continuous  spectrum  of  the  equation, 
and  their  shapes  can  be  double  humped  for  a  wide  range  of  soliton  parameters.  One  wonders 
how  these  double-humped  solitons  evolve  when  the  equation  is  perturbed — a  question  which  is 
interesting  and  significant  from  both  physical  and  mathematical  points  of  view.  This  question  can 
be  studied  by  a  soliton  perturbation  theory.  Due  to  the  embedded  nature  of  these  solitons,  external 
perturbations  will  generally  excite  continuous- wave  radiation  which  is  in  resonance  with  the 
soliton.  So  a  key  component  in  the  soliton  perturbation  theory  would  be  to  calculate  this 
continuous-wave  radiation,  for  which  squared  eigenfunctions  are  needed  (a  similar  situation  oc¬ 
curs  in  the  soliton  perturbation  theory  for  the  Hirota  equation  ).  A  special  feature  of  the  Sasa- 
Satsuma  equation  is  that,  while  the  linearized  operator  of  this  equation  is  2X2,  its  spectral 
operator  is  3  X  3.  How  to  build  two-component  squared  eigenfunctions  from  three-component  Jost 
functions  is  an  interesting  and  curious  question.  As  we  shall  see,  only  certain  components  of  the 
3X3  Jost  functions  and  their  adjoints  are  used  to  construct  the  squared  eigenfunctions  and  the 
adjoint  squared  eigenfunctions  of  the  Sasa-Satsuma  equation.  To  calculate  these  squared  eigen¬ 
functions  for  the  Sasa-Satsuma  equation,  we  have  tried  to  directly  use  the  first  method  mentioned 
above.  The  recursion  operator  for  the  Sasa-Satsuma  equation  has  been  derived  recently.  How¬ 
ever,  that  operator  is  quite  complicated;  thus  its  eigenfunctions  are  difficult  to  obtain.  The  second 
method  mentioned  above  is  met  with  difficulties  too.  Thus  we  choose  the  third  method  for  the 
Sasa-Satsuma  equation  and  demonstrate  here  certain  advantages  of  this  method. 

In  this  paper,  we  further  develop  the  third  method  using  the  Sasa-Satsuma  equation  as  an 
example.  We  show  that  the  functions  appearing  in  the  expansion  of  the  variations  of  the  potentials 
are  always  the  squared  eigenfunctions  which  satisfy  the  linearized  equation  of  an  integrable 
system,  and  that  the  functions  appearing  in  the  formulas  for  the  variations  of  scattering  data  are 
always  the  adjoint  squared  eigenfunctions  which  satisfy  the  adjoint  linearized  equation  of  an 
integrable  system.  In  addition,  given  these  two  relations  between  the  variations  of  the  potentials 
and  the  variations  of  scattering  data,  there  naturally  follows  the  closure  relation  for  the  squared 
eigenfunctions  and  their  adjoints,  as  well  as  all  the  inner-product  relations  between  the  squared 
eigenfunctions  and  their  adjoints.  Thus  no  longer  is  it  necessary  to  grind  away  at  calculating  these 
inner  products  from  the  asymptotics  of  the  Jost  functions.  Rather  one  can  just  read  off  the  values 
of  the  nonzero  inner  products  from  these  two  variation  relations.  This  clarifies  the  long-standing 
question  regarding  squared  eigenfunctions  in  connection  with  the  linearized  integrable  equation 
and  streamlines  the  third  method  as  a  general  and  conceptually  simple  procedure  for  the  derivation 
of  squared  eigenfunctions.  We  apply  this  method  to  the  Sasa-Satsuma  equation  and  find  that  it 
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readily  gives  the  squared  eigenfunctions  and  adjoint  squared  eigenfunctions.  The  squared  eigen¬ 
functions  for  the  Sasa-Satsuma  equation  are  sums  of  two  terms,  with  each  term  being  a  product  of 
a  component  of  a  Jost  function  and  a  component  of  an  adjoint  Jost  function.  This  two-term 
structure  of  squared  eigenfunctions  is  caused  by  the  symmetry  properties  of  scattering  data  of  the 
Sasa-Satsuma  equation.  It  should  be  noted  that  our  derivation  uses  almost  exclusively  the  spectral 
operator  of  the  Lax  pair;  thus  all  integrable  equations  with  the  same  spectral  operator  (such  as  the 
Sasa-Satsuma  hierarchy)  will  share  the  same  set  of  squared  eigenfunctions  as  we  derived  here 
(except  for  a  time-dependent  factor  which  is  equation  specific).  An  additional  benefit  of  the 
method  we  used  is  that  for  two  integrable  equations  with  similar  spectral  operators,  the  derivation 
of  their  squared  eigenfunctions  will  be  essentially  the  same.  Thus  one  can  get  squared  eigenfunc¬ 
tions  for  one  equation  by  minor  modifications  for  the  other  equation.  As  an  example,  we  demon¬ 
strate  in  the  Appendix  how  squared  eigenfunctions  for  the  Manakov  equations  can  be  easily 
obtained  by  minor  modifications  of  our  calculations  for  the  Sasa-Satsuma  equation. 


II.  THE  RIEMANN-HILBERT  PROBLEM 

To  start  our  analysis,  we  first  formulate  the  Riemann-Hilbert  problem  for  the  Sasa-Satsuma 
equation  which  will  be  needed  for  later  calculations.  The  Sasa-Satsuma  equation  is 

u,  +  uxxx  +  h|w|2u,  +  3w(jw|2)r  =  0.  ( 1 ) 

Its  spectral  (scattering)  problem  of  the  Lax  pair  is^ 


Yx=-i£AY  +  QY< 


where  Y  is  a  matrix  function,  A=diag(l  ,1,-1), 


Q- 


I  0 
0 

\-«* 


0  u  ] 
0  u 
~u  0  j 


(2) 


(3) 


is  the  potential  matrix,  the  “  *”  represents  complex  conjugation,  and  f  is  a  spectral  parameter.  In 
this  paper,  we  always  assume  that  the  potential  u(x )  decays  to  zero  sufficiently  fast  as  x— >  ±*. 
Notice  that  this  matrix  Q  has  two  symmetry  properties.  One  is  that  it  is  anti-Hermitian,  i.e., 

=  -Q ,  where  the  superscript  *T’  represents  the  Hermitian  of  a  matrix.  The  other  one  is  that 


<tQ(t=Q\ 


Introducing  new  variables 


where  cr=  cr  1 


0  1  o\ 
10  0. 
\0  0  1  / 


then  Eq.  (2)  becomes 


J=YET\  £  =  <T*a\ 


(4) 


(5) 


Jx=-i£\J]  +  QJ, 


(6) 


where  [A,J]  =  A./-7A.  The  matrix  Jost  solutions  J±(;c,£)  of  Eq.  (6)  are  defined  by  the  asymptotics 


J±  — ►  1  as  x  — >  ±  ac,  (7) 

where  ‘*1”  is  the  unit  matrix.  Here  the  subscnpts  in  J±  refer  to  which  end  of  the  x  axis  the 
boundary  conditions  are  set.  Since  tr  2=0,  using  Abel’s  formula  on  Eq.  (6),  we  see  that  det  J± 
=  1  for  all  x.  In  addition,  since  J±E  are  both  solutions  of  the  linear  equations  (7),  they  are  not 
independent  and  are  linearly  related  by  the  scattering  matrix  S(f)=[s,7(f)]: 
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J_  =  J+ESE~\  det  S  =  1  (8) 

for  real  f,  i.e.,  f  e  H  (for  nonreal  £,  certain  elements  in  S  may  not  be  well  defined). 

Due  to  the  two  symmetry  properties  of  the  potential  matrix  Q ,  the  Jost  solutions  J±  and  the 
scattering  matrix  S  satisfy  the  corresponding  symmetry  properties.  One  is  the  involutive  property. 
Since  the  potential  Q  is  anti-Hermitian,  we  see  that  J±(C)  and  /2'(f)  both  satisfy  the  adjoint 
equation  of  (6).  In  addition,  we  see  from  Eq.  (7)  that  jt(£*)  and  have  the  same  large-r 

asymptotics;  thus  they  are  equal  to  each  other: 

jUC)=Jli(0-  (9) 

Then  in  view  of  Eq.  (8),  we  see  that 

5+(r)=s-'(a  do) 

To  derive  the  other  symmetry  properties  of  the  Jost  solutions  and  the  scattering  matrix,  we  notice 
that  due  to  the  symmetry  (4)  of  the  potential  Q ,  it  is  easy  to  see  that  (rJ*±(-£*)cr  also  satisfies  Eq. 
(6).  Then  in  view  of  the  asymptotics  (7),  we  find  that  Jost  solutions  possess  the  following  addi¬ 
tional  symmetry: 


J±(Q  =  <rJU-n<r-  (ID 

From  this  symmetry  and  relation  (8),  we  see  that  the  scattering  matrix  S  possesses  the  additional 
symmetry 

S(f)  =  o5*(-  C)(T.  (12) 

Analytical  properties  of  the  Jost  solutions  play  a  fundamental  role  in  the  Riemann-Hilbert 
formulation  for  the  scattering  problem  (6).  For  convenience,  we  express  J±  as 

=  <!>  =  [<£„&,&],  (13) 

y* 3],  (14) 

where  and  W  are  fundamental  solutions  of  the  original  spectral  problem  (2)  and  are  related  by 
the  scattering  matrix  as 

<P  =  VS.  (15) 

Then  in  view  of  the  boundary  conditions  (7),  the  spectral  equation  (6)  for  7^  can  be  rewritten  into 
the  following  Vol terra-type  integral  equations: 

•/_(£;*)  =  1+  j  e'{A{-'~x)Q(y)J_(^y)ei{Aix~y)dy,  (16) 


J+(£;x)=l-  f  e'{A(y-x,Q(y)Jj£;y)e'{A(x~y)dv.  (17) 

These  integral  equations  always  have  solutions  when  the  integrals  on  their  right  hand  sides  con¬ 
verge.  Due  to  the  structure  (3)  of  the  potential  0,  we  easily  see  that  Eq.  (16)  for  the  first  and 
second  columns  of  7_  contain  only  the  exponential  factor  e‘i(x~v)  which  decays  when  f  is  in  the 
upper  half-plane  C+,  and  Eq.  (17)  for  the  third  column  of  J+  contains  only  the  exponential  factor 
£<£(>-*)  which  also  falls  off  for  £  e  C+.  Thus  these  three  columns  can  be  analytically  extended  to 
f  e  C+.  In  other  words,  Jost  solutions 
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PM<t>^<i>*'hVlXx=JJH\+J+n1 


are  analytic  in  £eC+,  where 


W,  =  diag(  1.1,0),  H2  =  diag(0,0, 1). 


(18) 


(19) 


Here  the  subscnpt  in  P  refers  to  which  half-plane  the  functions  are  analytic  in.  From  the  Volterra 
integral  equations  for  P+,  we  see  that 

P+U,£)->1  as  f  e  C+  — ►  oc.  (20) 

Similarly,  Jost  functions  [«Ai ,  are  analytic  in  £  e  C_,  and  their  large-£  asymptotics  is 

[t/r|,t^,03]e'fAj:— ►  1  as  £  e  C_  — >  oc.  (21) 

To  obtain  the  analytic  counterpart  of  P+  in  C_,  we  consider  the  adjoint  spectral  equation  of  (6): 


Kx  =  -i£A,K\-KQ. 

The  inverse  matrices  J~j  satisfy  this  adjoint  equation.  Notice  that 

r_]  =E<P~\  r+=EV~'. 

Let  us  express  d>_l  and  as  a  collection  of  rows. 


01 

0i 

C|>‘  =  <F  = 

02 

II 

III 

7 

02 

.03. 

-03. 

(22) 


(23) 


(24) 


where  the  overbar  refers  to  the  adjoint  quantity;  then  by  similar  techniques  as  used  above,  we  can 
show  that  the  first  and  second  rows  of  JZl  and  the  third  row  of  are  analytic  in  £eC_,  i.e., 
adjoint  Jost  solutions 


P_  =  e 


-i£  Ax 


01 

02 

03 


=  H{r]  +  h2i;[ 


are  analy  tic  in  f  e  C_.  In  addition,  their  large-f  asymptotics  is 

P_U,f)^l  as  C 


(25) 


(26) 


Similarly,  the  first  and  second  rows  of  and  the  third  row  of  J_\  i.e.,  e  and  3, 

are  analytic  in  £  e  C+,  and  their  large-£  asymptotics  is 


-/£A.r 


0i 

02 

03 


>  1  as  f  e  C+  ■ 


(27) 


In  view  of  the  involution  properties  (9)  of  7±,  we  see  that  the  analytic  solutions  P±  satisfy  the 
involutive  property  as  well: 


Pl(C)  =  PM). 


(28) 


This  property  can  be  taken  as  a  definition  of  the  analytic  function  P_  from  the  known  analytic 
function  P+. 
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The  analytic  properties  of  Jost  functions  described  above  have  immediate  implications  on  the 
analytic  properties  of  the  scattering  matrix  5.  Let  us  denote 

S-'(f)  =  S(£)  =  [*};(£)].  (29) 

Then  since 


<f>\ 

5  =  vJ/-'(J)  = 

^2 

5=0-'^  = 

<t>2 

>3. 

(30) 


we  see  immediately  that  sn,  s12,  s2l ,  s2 2<  ar|d  S33  can  t>e  analytically  extended  to  the  upper 
half-plane  feC+,  while  Jn,  s12,  J2i»  $22,  and  *33  can  be  analytically  extended  to  the  lower  half¬ 
plane  f  e  C_.  In  addition,  their  large-f  asymptotics  are 


mi  M2 
s2\  s22 


1,  i33  —  1  as  f  e  C+ 


(31) 


and 


Mi  s\2 

Mi  S22 


1,  s  33  — ►  1  as  (  e  C_ 


(32) 


Hence  we  have  constructed  two  matrix  functions  and  P_  which  are  analytic  in  C+  and  C_, 
respectively.  On  the  real  line,  using  Eqs.  (8),  (18),  (25),  and  (29),  we  easily  see  that 


P-U)P+(0  =  GUI  ie/?, 


(33) 


where 


1  0  S-\ 


£~  . 


(34) 


M3 

G  =  E(H\  +  h2s)(h]  +  s~]h2)e~]  =  e|  0  i  r23 

1*31  *32  1  / 

Equation  (33)  determines  a  matrix  Riemann-Hilbert  problem.  The  normalization  condition  for  this 
Riemann-Hilbert  problem  can  be  seen  from  (20)  and  (26)  as 


P±U,f)  — >  1  as  f  e  C±  — >  oc.  (35) 

If  this  problem  can  be  solved,  then  the  potential  Q  can  be  reconstructed  from  an  asymptotic 
expansion  of  its  solution  for  large  f.  Indeed,  writing  as 

p+(  x,o = 1 + r'p{l\x) + r2Pi2)(x) + o(r3).  (36) 

inserting  it  into  Eq.  (6),  and  comparing  terms  of  the  same  order  in  we  find  at  0(1)  that 

0  =  i'[A,P(l)].  (37) 

Thus  the  potential  Q  can  be  reconstructed  from  P(l).  At  0(£“'),  we  find  that 

PlJ)  =  -i\\,P(2)]+QPM.  (38) 

From  the  above  two  equations  as  well  as  the  large-jc  asymptotics  of  P+(jc,£)  from  Eq.  (7),  we  see 
that  the  full  matrix  P(1)  is 
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f  l«(y)IVy 

J  -OC 

J  u2(y)dy 

J  -oc 

u(x) 

P,nw  =  i 
2 1 

f  u2{y)dy 

J  -oc 

f  |«(y)|2rfy 

J  — oc 

u‘(x) 

(39) 

u(x) 

w(r)  ' 

2  f  |ii(j’)|2rfy 

J  X 

This  matrix,  together  with  Eq.  (36),  gives  the  leading-order  asymptotic  expansions  for  the  ana¬ 
lytical  functions  P+.  Leading-order  asymptotic  expansions  for  P_  can  be  similarly  derived. 

Elements  of  the  scattering  matrix  5  are  not  all  independent.  Indeed,  since  the  potential  matrix 
Q  contains  only  one  independent  function  m(jc),  matrix  5  should  contain  only  one  independent 
element  as  well.  To  get  the  dependence  of  scattering  coefficients,  we  first  notice  from  equation 
55=  1  that 


*3 3(0*33(0  =  1  -  *3l(0*l3(0  -  *32(0*23(0.  C  €  (40) 

Since  s33  and  J33  are  analytic  in  C_  and  C+,  respectively,  the  above  equation  defines  a  (scalar) 
Riemann-Hilbert  problem  for  (533, T33)  under  the  canonical  normalization  conditions  which  can  be 
seen  from  (31 )  and  (32).  Thus  (533,533)  can  be  uniquely  determined  by  the  locations  of  their  zeros 
in  C±  together  with  the  scattering  data  (53|,532,513,523)  on  the  real  line  R  by  the  Plemelj 
formula.  Similarly,  from  equation  55=1  we  have 


/*n  *12 
'*21  522 


/*11  s\2 
52i  5  22 


=  l-^n)(^31^32). 


f«R. 


(41) 


which  defines  another  (matrix)  Riemann-Hilbert  problem  for  (in*  *12  **21  **22)  and 
(*ii»*12» *21**22)  under  the  canonical  normalization  conditions  which  can  be  seen  from  (31)  and 
(32).  Thus  these  analytical  scattering  elements  can  be  determined  by  the  scattering  data 
(*31**32**13**23)  on  the  real  line  f e  R  as  well.  The  scattering  data  (531 ,532,513,523)  themselves  are 
dependent  on  each  other  by  the  symmetry  relations  (10)  and  (12).  Specifically,  for  £e  R,  513(£) 
=*;,(£),  and  s32(£)=S3,  (-£).  Thus  the  scattering  matrix  5  indeed  contains  only  a 

single  independent  element  as  we  would  expect. 

In  the  next  section,  we  will  derive  squared  eigenfunctions  for  the  Sasa-Satsuma  equation.  For 
the  convenience  of  that  derivation,  we  introduce  the  following  notations: 


*31 

*32 

*13 

*23 

f 

111 

<N 

Pi  -  —  * 

P2=  — 

*33 

*33 

*33 

*33 

(42) 


Due  to  the  symmetry  conditions  (10)  and  (12)  of  the  scattering  matrix  5,  we  see  that  pk  and  pk 
satisfy  the  symmetry  conditions 


p*(0-pl(0.*=1.2.  fc(0-P*(-0.  P2(0  =  Pi(-0.  feR.  (43) 

In  the  next  section,  symmetry  properties  of  and  will  also  be  needed.  Due  to  the  symmetry 
conditions  (9)  and  (11)  of  the  Jost  solutions,  4>  and  ¥  satisfy  the  symmetry  conditions 

a>t(r)  =  4>-'(f),  (44) 

and 

*±(0  =  o4C(-  n*  ^±(0  =  Do-.  (45) 

Symmetry  (44)  means  that 
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<t>\(C)=Ul\  ip[(C)=4>kU).  *=  1.2,3,  (46) 

while  symmetry  (45),  together  with  (46),  means  that 

<r<Mf)  =$(-£),  v<htt)  =  #(-  <*&(£)«$(-£).  (47) 

Here  the  superscript  T  represents  the  transpose  of  a  matrix.  These  symmetry  properties  will  be 
important  for  deriving  the  final  expressions  of  squared  eigenfunctions  for  the  Sasa-Satsuma  equa¬ 
tion  in  the  next  section. 


III.  SQUARED  EIGENFUNCTIONS  AND  THEIR  CLOSURE  RELATION 

In  this  section,  we  calculate  the  variation  of  the  potential  via  variations  of  the  scattering  data, 
then  calculate  variations  of  the  scattering  data  via  the  variation  of  the  potential.  The  first  step  will 
yield  squared  eigenfunctions,  and  it  will  be  done  by  the  Riemann-Hilbert  method.  The  second  step 
will  yield  adjoint  squared  eigenfunctions,  and  it  will  be  done  using  basic  relations  of  the  spectral 
problem  (2).  For  the  ease  of  presentation,  we  first  assume  that  s33  and  s33  have  no  zeros  in  their 
respective  planes  of  analyticity,  i.e.,  the  spectral  problem  (2)  has  no  discrete  eigenvalues.  This 
facilitates  the  derivation  of  squared  eigenfunctions.  The  results  for  the  general  case  of  533  and  s33 
having  zeros  will  be  given  at  the  end  of  this  section. 


A.  Variation  of  the  potential  and  squared  eigenfunctions 

In  this  subsection,  we  derive  the  variation  of  the  potential  via  variations  of  the  scattering  data, 
which  will  readily  yield  the  squared  eigenfunctions  satisfying  the  linearized  Sasa-Satsuma  equa¬ 
tion.  Our  derivation  will  be  based  on  the  Riemann-Hilbert  method. 

To  proceed,  we  define  the  following  matrix  functions 

F+  =  P+  diag(  1,1,3-  ),  F_=P;' diagd.l.Sjj).  (48) 

\  sn' 

The  reason  to  introduce  diagonal  matrices  with  s33  and  s33  in  F±  is  to  obtain  a  new  Riemann- 
Hilbert  problem  (49)  with  a  connection  matrix  G  which  depends  on  pk  and  pk  rather  than  and 
Sjj.  This  way,  the  variation  of  the  potential  will  be  expressed  in  terms  of  variations  in  pk  and  pk. 
When  s33  and  s33  have  no  zeros  in  C+  and  C_,  respectively,  then  F±  as  well  as  F~J  are  analytic  in 
C±.  On  the  real  line,  they  are  related  by 


F+(0  =  F-(0GUI  f  6  R,  (49) 

where 


1 


533 


G  =  diagl  1,1, —  G  diag (  1,1,—  J  =  E 


1 


*33 


i  o  p, 

0  1  p2 

\Pl  Pi  l+PlPl+P2P2^ 


\Erl. 


(50) 


Here  relation  (40)  has  been  used.  Equation  (49)  defines  a  regular  Riemann-Hilbert  problem  (i.e., 
without  zeros). 

Next  we  take  the  variation  of  the  Riemann-Hilbert  problem  (49)  and  get 


SF+  =  SF_G  +  F_SG,  (  e  R. 
Utilizing  Eq.  (49),  we  can  rewrite  the  above  equation  as 


(51) 
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SF+rJ  =  SF.r'  +  F.SGrJ,  {eR,  (52) 

which  defines  yet  another  regular  Riemann-Hilbert  problem  for  SFF~l.  Unlike  the  previous 
Riemann-Hilbert  problems  (33)  and  (49)  which  were  in  matrix  product  forms,  the  present 
Riemann-Hilbert  problem  (52)  can  be  explicitly  solved.  Using  the  Plemelj  formula,  the  general 
solution  of  this  Riemann-Hilbert  problem  is 


6Fr'((\x)  =  A0(x)  + 


JLf 

2v i  J_. 


H(g;x) 

(-( 


(53) 


where 


II(f;jr)  -  F_(£x)8G(frx)r+l(&x),  feR. 


(54) 


Now  we  consider  the  large-f  asymptotics  of  this  solution.  Notice  from  expansions  (36)  and  (31) 
and  relation  (48)  that  as  f— +x. 


F+((\x)- 


1+01“ 


O 


■  C 
u(x) 

2  a 


u(x) 
2  if 


uW 

2  a 

u(x) 

2  i( 

,  +  o(i 


«F+(f«) 


01 J 

Su*{x)  Su(x) 

HuT  lijT 


O 


1  ]  Su(x) 

Su*(x) 
2  i( 


In  addition,  it  is  easy  to  see  that 


P  n(£;jc)  1  P 

n  (€;xW. 

J  -oo  S  b  S  J  -x 


(55) 


(56) 


When  these  large-f  expansions  are  substituted  into  Eq.  (53),  at  0(1),  we  find  that  A0(x)=0.  At 
O(p'),  we  get 


<$t(jc) 


=  - -  J  ni3(£*)df  &/'(x)  =  --f  Ili,(£;x)d£.  (57) 

^  J  — oc  ^  J  — x 


Now  we  calculate  the  elements  II i3  and  n3j.  From  Eq.  (50),  we  see  that 


SG-E 


0  0 


<5 p, 

0  0  8 p2  |e“' 

\8p,  Sp2  P]8pt  +  p,8pi  +  p28p2  +  p28p2l 


(58) 


Then  in  view  of  the  F±  definitions  (48)  and  P±  expressions  (18)  and  (25),  we  find  from  (54)  that 

ri(f ;jc)  =  J.E(HX  +  H2S)-]  diag(  1 , 1  ,s33)ET] SGE  diag(  1 ,  1 ,  J33)(//,  +  S~]H2y]Er]r\  (59) 
which  simplifies  to 
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n(f;jc)  =  <D 


(  0 

0 

Wi 


0 

0 

Sp2 


$0 

Sp2 

0  / 


<D. 


(60) 


This  relation,  together  with  Eq.  (57),  shows  that  the  variation  of  the  potential  Su  can  be  expanded 
into  quadratic  combinations  between  Jost  solutions  and  adjoint  Jost  solutions  <t>.  This  relation  is 
generic  in  integrable  systems. 

Now  we  calculate  explicit  expressions  for  Su.  Inserting  (60)  into  (57)  and  recalling  our 
notations  (24),  we  readily  find  that 


i  r  - 

Su--  I  (0310|3^Pl  +  031023<^P2 +  011033<?Pl  +  021  033  ^Pl)d€-  (61) 

TTJ_  x 


Here  the  notations  are 


0*i 

<A*i 

0i k- 

0*2 

■  0*  = 

<A*2 

.0*3. 

>*3. 

.  4>k  =  [<t,kb<t,k2'4>k-<l  'Pk  =  [>Pk\>'l'k2*'Pa\  k=  1,2,3.  (62) 

Utilizing  the  symmetry  relations  (43)  and  (47),  the  above  Su  formula  reduces  to 
1  fx  _ 

!  =  -~I  [(031013  +  033012)  Sp\  +  (<f>\  1033  +  013032)  ^l]^£*  (63) 

TTj-o c 


Su  ■ 


This  is  an  important  step  in  our  derivation,  where  symmetry  conditions  play  an  important  role. 
Regarding  Su *,  its  formula  can  be  obtained  by  taking  the  complex  conjugate  of  the  above  equation 
and  simplified  by  using  the  symmetry  relations  (46).  Defining  functions 


Z,= 


031 013  +  033012 
_  03 2013  +  03301 1  _ 


z2  = 


01 1  033  +  01  3032 
_  012033  +  013031  _ 


then  the  final  expression  for  the  variation  of  the  potential  ( Su,Su*)T  is 


Su(x) 

Su*(x) 


(&x)Spx(&  +  Z2((;x)Sf>l(&]dt 


(64) 


(65) 


Notice  here  that  due  to  the  symmetry  relations  (46),  Z2(£)  is  equal  to  2T|(£* )  with  its  two  compo¬ 
nents  swapped.  Also  notice  that  Z,  and  Z2  are  the  sum  of  two  terms,  where  each  term  is  a  product 
of  a  component  of  a  Jost  function  and  a  component  of  an  adjoint  Jost  function.  The  two-term 
feature  of  these  functions  is  caused  by  the  symmetry  properties  of  the  scattering  data  and  Jost 
functions,  which  enable  us  to  combine  terms  together  in  the  general  expansion  (61 ).  In  the  massive 
Thirring  model,  the  counterparts  of  functions  ZX  2  are  also  sums  of  two  terms.4  The  two-term 
structure  there  is  not  due  to  symmetry  properties  but  rather  due  to  the  asymptotics  of  its  Jost 
functions  in  the  spectral  plane.  The  feature  of  each  term  in  Zj  and  Z2  being  a  product  between  a 
Jost  function  and  an  adjoint  Jost  function,  on  the  other  hand,  is  a  generic  feature  in  integrable 
systems.  In  previous  studies  on  many  integrable  equations  (such  as  the  KdV,  NLS,  derivative  NLS, 
and  massive  Thirring  equations),  it  was  found  that  these  functions  were  often  “squares”  or  prod¬ 
ucts  of  Jost  functions  themselves  (thus  the  name  “squared  eigenfunctions”).1, 3,4,1 6  This  was  so 
simply  because  the  spectral  operators  in  those  systems  were  2X2,  for  which  the  adjoint  Jost 
functions  (rows  of  the  inverse  of  the  Jost  function  matrix)  are  directly  proportional  to  Jost  func¬ 
tions  themselves.  That  is  not  generic,  however,  and  does  not  hold  for  the  Sasa-Satsuma  equation 
or  in  general  for  integrable  equations  whose  spectral  operator  is  3  X  3  or  higher. 

The  derivation  of  (65)  for  the  expansion  of  (Su,  Su*)T  is  an  important  result  of  this  subsection. 
It  readily  gives  the  variations  in  the  Sasa-Satsuma  fields  in  terms  of  variations  in  the  initial  data. 
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To  show  this,  we  first  restore  the  time  dependence  in  Eq.  (65).  An  important  fact  we  need  to  notice 
here  is  that  the  Jost  solutions  <J>  as  defined  in  (13)  do  not  satisfy  the  time  evolution  equation  of  the 
Lax  pair  of  the  Sasa-Satsuma  equation.  Indeed,  this  time  evolution  equation  of  the  Lax  pair  is?1 

Y,=  -4i?\Y+V((,u)Y,  (66) 

where  the  matrix  V  goes  to  zero  as  x  approaches  infinity.  Obviously  the  large -j  asymptotics  of  O 
[see  (7)  and  (13)]  cannot  satisfy  the  above  equation  as  |jc|  — ►  ^  where  V  vanishes.  But  this  problem 
can  be  easily  fixed.  Defining  the  “time-dependent”  Jost  functions 

O'')  =  (67) 

then  these  functions  satisfy  both  parts  of  the  Lax  pair,  (2)  and  (66).  The  reason  they  now  satisfy 
the  time  evolution  equation  (66)  is  due  to  their  satisfying  the  asymptotic  time  evolution  equation 
(66)  as  |jc|  — ►  oo  as  well  as  the  compatibility  relation  of  the  Lax  pair.  Similarly  we  define  the 
time-dependent  adjoint  Jost  functions  as 


<j>W  =  e4'fJA'0,  (68) 

which  satisfy  both  adjoint  equations  of  the  Lax  pair.  Now  we  use  these  new  Jost  functions  and 
adjoint  Jost  functions  to  replace  those  in  the  definitions  (64)  and  get  time -dependent  (Zl,Z2) 
functions 


A')= 


ot)  jit)  ,  .it)  jit) 

03  10]  3  +  033012 

jit)  jit)  ,  jiOjit) 

L  032013  +  03301 1  - 


4'*= 


L«+«J 


(69) 


In  view  of  relations  (67)  and  (68),  we  see  that  these  time-dependent  (Zj,^)  functions  are  related 
to  the  original  ones  as 


zf  =  Z,  e8iC\  Zj1  =  Z2e-8i{3'.  (70) 

Another  fact  we  need  to  notice  in  conjunction  with  the  time-restored  equation  of  (65)  is  that 
for  the  Sasa-Satsuma  equation,  the  time  evolution  of  the  scattering  matrix  5  is  given  by 

S,  =  -4»£3[A,S].  (71) 

Thus 


4(r)  =  0,  s;,(f)  =  8rf3s3I(/),  (72) 

where  the  prime  indicates  differentiation  with  respect  to  t.  Then  recalling  definition  (42)  of  (p,  ,pj) 
as  well  as  the  symmetry  property  (10),  we  obtain  the  time  evolution  of  the  varied  scattering  data 
as 


8pl(t,t)=Spl(lO)e*i(3',  Spl(lt)  =  8pl(l  0)e~8i(}'.  (73) 

Now  we  insert  the  above  time-dependence  relations  (70)  and  (73)  into  the  time-restored 
expansion  (65)  and  get  a  new  expansion, 

=  --[  [Zf,({yr.r)4>i(f.O)  +  2$,({;ji;rt45I(fcO)]£/f.  (74) 

tt  J  _oc 

The  advantages  of  this  expansion  are  that  the  Jost  functions  d>(/)  in  [Z^\Z^]  satisfy  both  equations 
of  the  Lax  pair,  and  the  expansion  coefficients  [<5p|,<$p,]  are  time  independent. 

Since  Su  and  Su*  must  satisfy  the  homogeneous  linearized  Sasa-Satsuma  equation,  it  follows 
that  the  functions  [Z^,Z^]  appearing  in  expansion  (74)  must  also  and  precisely  satisfy  the  same 
linearized  Sasa-Satsuma  equation.  A  similar  fact  for  several  other  integrable  equations  has  been 


Su(xj) 
Su*(x  J) 
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noted  before  by  direct  verification  (see  Ref.  6,  for  instance).  Here  we  will  give  a  simple  proof  of 
this  fact  which  holds  for  any  integrable  equation  where  an  expansion  such  as  (74)  exists.  Suppose 
the  linearized  operator  of  the  Sasa-Satsuma  equation  for  [u(x,t),u*(x,t)]T  is  C.  Since  the  potential 
u(x,t)  satisfies  the  Sasa-Satsuma  equation  and  the  variation  of  the  potential  [&/(.*,/),  <5w*(jc,/)]r  is 
infinitesimal,  it  follows  that 


Su(xj) 

Su*(xj) 


=  0. 


Inserting  Eq.  (74)  into  the  above  equation,  we  get 


(75) 


U./)<W£0)  +  /^(fc*./)Jp|(fc0)]d{«0. 


(76) 


Since  the  initial  variations  of  the  scattering  data  (<5pi,£pi)(£,0)  are  arbitrary  and  linearly  inde¬ 
pendent,  it  follows  that 


CZ<{)(Z\xj)  =  Cz¥(Z\x,t)  =  0  (77) 

for  any  £  e  R.  Thus  Z^’  and  Z!/1  satisfy  the  linean/ed  Sasa-Satsuma  equation  and  are  therefore  the 
squared  eigenfunctions  of  this  equation. 

Now  in  the  derivation  of  (74),  from  (52)  forward,  no  constraints  have  been  put  on  the  varia¬ 
tions  of  u  and  w*,  except  for  the  implied  assumption  that  the  scattering  data  for  u  and  u*  and  its 
variations  will  exist.  Whence  we  expect  that  [Su,Su*]T  in  (74)  can  be  considered  to  be  arbitrary. 
If  so,  then  due  to  the  equality  which  we  see  in  (74),  we  already  can  expect  {Z^(£; jt,/),Zv 
X(£;jc,/),£e  R}  to  form  a  complete  set  in  an  appropriate  functional  space  (such  as  L{). 

We  remark  that  the  squared  eigenfunctions  given  here  are  all  linearly  independent.  That  can  be 
easily  verified  upon  recognizing  that  from  (2),  one  may  readily  construct  an  integrodifferential 
eigenvalue  problem  for  which  these  squared  eigenfunctions  are  the  true  eigenfunctions.  Then  in 
the  usual  manner  it  follows  that  these  squared  eigenfunctions  are  linearly  independent.  Similar  for 
the  adjoint  squared  eigenfunctions.  The  linear  independence  of  these  squared  eigenfunctions  can 
also  be  easily  established  after  we  have  obtained  the  dual  relations  of  (74)  which  give  variations 
of  scattering  data  due  to  arbitrary  variations  of  the  potentials  (see  next  section). 

The  squared  eigenfunctions  [Z^\Z^]  have  nice  analytic  properties.  Indeed,  recalling  the 
analytic  properties  of  Jost  functions  discussed  in  Sec.  II,  we  see  that  Z^(f)*“2^  is  analytic  in  C_, 
while  is  analytic  in  C+.  These  analytic  properties  will  be  essential  when  we  extend  our 

results  to  the  general  case  where  s33  and  s33  have  zeros  (see  end  of  this  section). 

It  should  be  noted  that  in  expressions  (69)  for  [2^,2^],  if  is  replaced  by  other  Jost 
functions  (such  as  4>\)  or  if  is  replaced  by  other  adjoint  Jost  functions  (such  as  <f>^\  the 
resulting  functions  would  still  satisfy  the  linearized  Sasa-Satsuma  equation  and  are  thus  also 
squared  eigenfunctions  of  this  equation.  These  facts  can  be  verified  directly  by  inserting  such 
functions  into  the  linearized  Sasa-Satsuma  equation  and  noticing  that  and  satisfy  the  Lax 
pair  (2)  and  (66)  and  the  adjoint  Lax  pair,  respectively.  Similar  results  also  hold  for  other  inte¬ 
grable  equations.  However,  these  other  squared  eigenfunctions  in  general  do  not  have  nice  analytic 
properties  in  C±  nor  are  they  linearly  independent  of  set  (69). 


B.  Variations  of  the  scattering  data  and  adjoint  squared  eigenfunctions 

In  this  subsection,  we  calculate  variations  of  the  scattering  data  which  occur  due  to  arbitrary 
variations  in  the  potentials.  These  formulas,  together  with  Eq.  (65)  or  Eq.  (74),  will  give  the 
“adjoint"  form  of  the  squared  eigenfunctions  as  well  as  their  closure  relation. 

We  start  with  the  spectral  equation  (2)  or  Jost  functions  <t>  or,  specifically. 
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d>,=  -/fAO  +  0O.  (78) 

Taking  the  variation  to  this  equation,  we  get 

S$>x  =  -itAd$>  +  Q8$>+  SQ<t> .  (79) 

Recalling  that  <t>— >e~l^x  as  x— >-a>  [see  Eqs.  (7)  and  (13)],  thus  ►O  as  x— >-x.  As  a  result, 
the  solution  of  the  inhomogeneous  equation  (79)  can  be  found  by  the  method  of  variation  of 
parameters  as 


<t>  <$£?(>')$(£;. vKy. 


(80) 


Here 


SQ  = 

V 


0 

0 

Su* 


0  Su 
0  Su 
- Su  0 


(81) 


is  the  variation  of  the  potential.  Now  we  take  the  limit  of  x— in  the  above  equation.  Recalling 
<t>  =  'ES  and  the  asymptotics  of  XE— ►  £  as  jc— ►  »,  we  see  that  <t >— ►ES,  S$>  — >ESS  as  jc— ►x.  Thus  in 
this  limit,  the  above  equation  becomes 


<W£)  =  S(£)  {«R.  (82) 

J  —oo 

Noticing  that  StfHs'K"'  =  4^  the  above  equation  can  be  rewritten  as 

SS(£)=  j  V(bx)SQ(x)<t>(l,x)dx,  |eR.  (83) 

J  —oo 

This  formula  gives  variations  of  scattering  coefficients  Ssjj  via  the  variation  of  the  potential  SQ. 
From  it,  the  variation  of  the  scattering  data  Spx  can  be  readily  found.  This  Spx  formula  contains 
both  Jost  functions  <t>  and  adjoint  Jost  functions  'E.  When  we  further  express  <f>  in  terms  of  'F 
through  relation  (15)  (so  that  the  boundary  conditions  of  the  Jost  functions  involved  are  all  set  at 
;t=+x),  we  obtain  the  expression  for  Spx  as 


Sp 


-H 


[<&(  (At  ,  —  (A33<P2)+  <*>«*(  V'32<P3  ~  , 


where  the  column  vector  <p  is  dehned  as 


(84) 


(f>=[(Pl,(f>2,<P3]r--S22<Al  --S21<A2-  (85) 

The  variation  <5p|  can  be  derived  by  taking  the  complex  conjugate  of  the  above  equation  and 
utilizing  the  symmetry  relations  (43)  and  (46).  Defining  functions 


9~  \.9l<92'9i]  —  s22tl,\  -  s\2tl>2-' 

<fal9 3  - 

^31^3  —  033^2  _ 

and  inner  products 


fl,  = 


<fa\93~  <ki92 
4>2293  ~  &3<Pl 


f l,= 


(86) 

(87) 
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f(x)g(x)dx , 


then  the  final  expressions  for  variations  of  the  scattering  data  <5p,  and  Sp]  are 


SpM 


(88) 


(89) 


<5p,(f)  =  ^-(n2(fu), 

533(£)  \ 

where  £  e  H . 

The  above  two  equations  are  associated  with  expansion  (65)  of  the  previous  section,  where 
time  is  frozen  in  both  cases.  A  small  problem  with  these  equations  is  that  the  Jost  functions  and 
adjoint  Jost  functions  appearing  in  definitions  (87)  of  do  not  satisfy  the  time  evolution 

equation  (66)  of  the  Lax  pair  (see  the  previous  section).  To  fix  this  problem,  we  repeat  the  practice 
of  the  previous  section  and  introduce  time-dependent  versions  of  the  functions  [fl|,fl2]  as 


Su(  x) 
Su'(x) 


(90) 


n(,0= 


SMSJStf  n,„_ 

where  <p(f)(£»-*T)  and  <p{t\^x,t)  are  defined  as 

<pU)  =  S22{&^ I*’  -  s2 i(f)^2)’  <PU)  -  s22(f)^l'1  -  Si2(&t^\ 

<t>u)  and  have  been  defined  in  (67)  and  (68),  and  Hr(r)  and  are  defined  as 


(91) 


(92) 


\p(0  =  \p>-4/£3Ar  _  e4iC*\tqr  (93) 

Notice  from  (71)  and  the  symmetry  condition  (10)  that  (512^22^21^22)  are  time  independent. 
Similar  to  what  we  have  done  in  the  previous  section,  we  can  show  that  <p{t)  and  in  definition 
(91)  are  Jost  functions  satisfying  both  the  Lax  pair  (2)  and  (66),  while  functions  <p^  and  'P^ 
satisfy  both  the  adjoint  Lax  pair.  In  addition,  [O,  are  related  to  [fl,,{l2]  as 

n^-n^A  n^=n2eii{3'.  (94) 

Inserting  this  relation  and  (73)  into  the  time-restored  equations  (89)  and  (90),  we  get  the  new 
relations 


<W£0) 


Su(xj) 

&/*(jt,r) 


(95) 


<5pi(£0) 


nM{;x,t), 


Su(xj) 

Su*(xj) 


(96) 


These  two  new  relations  are  associated  with  the  new  expansion  (74)  in  the  previous  section. 

The  functions  appearing  in  the  above  variations  of  scattering  data  formulas  (95) 

and  (96)  are  precisely  adjoint  squared  eigenfunctions  satisfying  the  adjoint  linearized  Sasa- 
Satsuma  equation.  Analogous  facts  for  several  other  integrable  equations  have  been  noted  before 
by  direct  verification  (see  Ref.  6).  Below  we  will  present  a  general  proof  of  this  fact  which  will 
hold  for  those  integrable  equations  where  we  have  relations  similar  to  (74),  (95),  and  (96).  Along 
the  way,  we  will  also  obtain  the  closure  relation,  orthogonality  relations,  and  inner  products 
between  the  squared  eigenfunctions  and  the  adjoint  squared  eigenfunctions. 
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First,  we  insert  Eqs.  (95)  and  (96)  into  expansion  (74).  Exchanging  the  order  of  integration, 
we  get 


Su(xj) 
Su*(; t,  t) 


-irr 

^  J  —  QC  J  — X 


x(ffx.r)n?r(ftjr',r) 


dt 


Su(x'j) 
_6u*(x'j)  J 


dx’. 


(97) 


Since  we  are  considering  that  (Su,Su*)  are  arbitrary  localized  functions  (in  an  appropriate  func¬ 
tional  space  such  as  L,),  in  order  for  the  above  equation  to  hold,  we  must  have 


533(£) 


4)(£;jr,f)n(1',r(£u',f)  + 


jtfl 


dt=%x-x')  1, 


(98) 


which  is  the  closure  relation  (discrete-spectrum  contributions  are  absent  here  due  to  our  assump¬ 
tion  of  533,  533  having  no  zeros).  Here  ($U)  is  the  Dirac  delta  function.  This  closure  relation  has  the 
usually  symmetry  in  that  not  only  do  the  squared  eigenfunctions  e  K} 

form  a  complete  set  but  also  the  adjoint  functions  *(£’*’ *)*£ G  form  a  complete 

set  as  well. 

To  obtain  the  inner  products  and  orthogonality  relations  between  these  functions,  we  reverse 
the  above  and  insert  expansion  (74)  into  Eqs.  (95)  and  (96).  Exchanging  the  order  of  integration, 
we  get 


<%>■(£< 

L<5pi(£< 


■  o)  r 

■o)J  J_x 


dpi(f\ 0) 
<5pi(f,0) 


(99) 


where 


1 


<(l\ ,)(f;jr.r).Z?><f;xf)) 


\ 


™33<£) 


7 


(100) 


Since  relation  (99)  is  taken  to  be  valid  for  arbitrary  functions  of  £p,(£,0)  and  <5p,(£, 0), 

*£-£').  (101) 


►equently 

[Z^Z^1  Am/<tinnr  rriM 


Consequently  we  get  the  inner  products  and  orthogonality  relations  between  squared  eigenfunc¬ 
tions 


]  and  functions  [fi  j  ,0!,'  ]  as 


<fl?,(£;jt.r).zy,(f  u.f)>  =  -  f ). 


(102) 


(103) 


(ni')($;jt.r).4',(fu.f)>  =  <nS!')(^f),Z(l')(fu.f))  =  0  (104) 

for  any  e  H. 

To  show  that  (n,,fl2)  are  adjoint  squared  eigenfunctions  which  satisfy  the  adjoint  linearized 
equation  of  the  Sasa-Satsuma  equation,  we  first  separate  the  temporal  derivatives  from  the  spatial 
ones  in  the  linearized  operator  C  as 
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£  =  L?,  +  L, 


(105) 


where  L  only  involves  spatial  derivatives.  The  adjoint  linearized  operator  is  then 

CA  =  -  l#9f  +  La  ,  (106) 

where  LA  is  the  adjoint  operator  of  L.  Now  we  take  the  inner  product  between  the  equation 
£Z*,r)(£' ; jc, r)  =  0  and  function  When  Eq.  (105)  is  inserted  into  it,  we  find  that 

{L^(n  n(, '>(£))  -  = o.  ( 1 07) 

Here  and  immediately  below,  the  (jc.f)  dependence  of  2  and  fl,  2  is  suppressed  for  notational 
simplicity.  Recall  that  533  is  time  independent  for  the  Sasa-Satsuma  equation;  thus  from  the 
inner-product  equation  (102)  we  see  that  <9,(Z^(£'),ftif)(0)=O.  Using  integration  by  parts,  the  first 
term  in  the  above  equation  can  be  rewritten  as 

n?>(o> + w(f ,0i;±,  *•  -  *.  nos) 

where  function  W(£'  ,£;jt,f)  contains  terms  which  are  generated  during  integration  by  parts.  These 
terms  are  quadratic  combinations  of  Z*,r)(£'),  ftjr)(0,  and  their  spatial  derivatives.  When  x=  ±L 
—►±oc,  each  term  has  the  form /(0)£(0exp(±/£'L±/£L±4/03r±4/£3r)  due  to  the  large-x  as¬ 
ymptotics  of  Jost  functions  O  and  Her e/(f')  and  g(0  are  related  to  scattering  coefficients  and 
are  continuous  functions.  Then  in  the  sense  of  generalized  functions,  the  last  term  in  Eq.  (108)  is 
zero  due  to  the  Riemann-Lebesgue  lemma.  Inserting  the  above  results  into  Eq.  (107),  we  find  that 

(Zf{\nCA^))  =  0,  £\{e  R,  (109) 

i.e.,  CaQ}(\()  is  orthogonal  to  all  Z^(0).  By  taking  the  inner  product  between  the  equation 
£Z(2r)(£')  =  0  and  function  0^(0  and  doing  similar  calculations,  we  readily  find  that 

<Z(2,)(f)X^n(/)(0)  =  0,  £',£<=  R.  (110) 

In  other  words,  £^(1^(0  is  also  orthogonal  to  all  zif)(£').  Since  {Z^(£'),Z^(  £'),£'  e  R}  forms  a 
complete  set,  Eqs.  (109)  and  (1 10)  dictate  that  £*ftj  has  to  be  zero  for  any  and  r,  i.e., 

£4n(1r)(0x,r)  =  0,  ?eB.  (Ill) 

Thus  ft is  an  adjoint  squared  eigenfunction  that  satisfies  the  adjoint  linearized  Sasa-Satsuma 
equation  for  every  £  e  R.  Similarly,  it  can  be  shown  that  ft2r)(0  is  also  an  adjoint  squared  eigen¬ 
function  for  every  fell.  In  short,  {ft^^ft^fh^  e  R}  is  a  complete  set  of  adjoint  squared 
eigenfunctions. 

Like  squared  eigenfunctions  (Z^,f),z!,r)),  these  adjoint  squared  eigenfunctions  (ft^ft^)  are 
also  quadratic  combinations  of  Jost  functions  and  adjoint  Jost  functions.  In  addition,  they  have 
nice  analytic  properties  as  well.  To  see  this  latter  fact,  notice  from  definition  (85)  that  (pe‘^x  is 
analytic  in  C_  since  s21,s22, ip\e‘-x  and  ip2e^x  are  analytic  in  CL.  Then  recalling  that  ip 3e^x  is  also 
analytic  in  CL,  we  see  that  ftjf)(0e2'^  is  analytic  in  C_.  Similarly,  ft^V^V”27^  is  analytic  in  C+. 

It  is  worthy  to  point  out  that  in  the  above  adjoint  squared  eigenfunctions  (91),  if  <p is 
replaced  by  the  Jost  function  ip^  (or  ip^)  or  if  ^  is  replaced  by  the  adjoint  Jost  function  ip^  (or 
4^),  the  resulting  functions  would  still  satisfy  the  adjoint  linearized  Sasa-Satsuma  equation.  Thus 
one  may  be  tempted  to  simply  take  (p>^=  ip^  (k=  1  or  2)  rather  than  (92)  as  adjoint 

squared  eigenfunctions.  The  problem  with  these  “simpler”  adjoint  squared  eigenfunctions  is  that 
their  inner  products  with  squared  eigenfunctions  (69)  are  not  what  they  are  supposed  to  be  [see 
(102 Ml 04)],  neither  does  one  get  the  closure  relation  (98).  Thus  the  adjoint  squared  eigenfunc¬ 
tions  (91)  coming  from  our  systematic  procedure  of  variation  calculations  are  the  correct  ones  to 
use. 
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C.  Extension  to  the  general  case 

In  the  previous  subsections,  the  squared  eigenfunctions  and  their  closure  relation  were  estab¬ 
lished  under  the  assumption  that  533  and  s33  have  no  zeros,  i.e.,  the  spectral  equation  (2)  has  no 
discrete  eigenvalues.  In  this  subsection,  we  extend  those  results  to  the  general  case  where  s33  and 
J33  have  zeros  in  their  respective  planes  of  analyticity  C±.  Due  to  the  symmetry  properties  (10)  and 
(12),  we  have 


*33(0  =  %(£*).  *33(£)  =  533(— D-  (112) 

Thus  if  e C_  is  a  zero  of  533,  i.e.,  533(^)=0,  then  — £*  eC_  is  also  a  zero  of  533,  and 
e  C+  are  both  zeros  of  s33.  This  means  that  zeros  of  533  and  s33  always  appear  in  quadruples. 
Suppose  all  the  zeros  of  533  and  s33  are  fy  e  C_  and  eC+,  y=  1 , ...  ,2 N,  respectively.  For  sim¬ 
plicity,  we  also  assume  that  all  zeros  are  simple. 

In  this  general  case  with  zeros,  squared  eigenfunctions  Z^,Z^  clearly  still  satisfy  the  linear¬ 
ized  Sasa-Satsuma  equation,  and  adjoint  squared  eigenfunctions  still  satisfy  the  adjoint 

linearized  Sasa-Satsuma  equation — facts  which  will  not  change  when  s33  and  s33  have  zeros.  The 
main  difference  from  the  previous  no-zero  case  is  that,  the  sets  of  (continuous)  squared  eigen¬ 
functions  {Z^(f;jc),Z^ (£;*), K}  and  adjoint  squared  eigenfunctions 
e  K}  are  no  longer  complete,  i.e.,  the  closure  relation  (98)  does  not  hold  any  more,  and  contribu¬ 
tions  from  the  discrete  spectrum  must  be  included  now.  To  account  for  discrete-spectrum  contri¬ 
butions,  one  could  proceed  by  adding  variations  in  the  discrete  scattering  data  in  the  above 
derivations.  But  a  much  easier  way  is  to  simply  pick  up  the  pole  contributions  to  the  integrals  in 
the  closure  relation  (98),  as  we  will  do  below.  Recall  from  the  previous  subsections  that  Z*,;) 
X(£;xj)e~2,£x  and  fl^(f ;xj)e2,^K  are  analytic  in  C_,  while  Z^(f;jcff)e21**  and  il^{(,\xj)e~2^x 
are  analytic  in  C+.  In  addition,  from  the  large-f  asymptotics  (31),  (32),  and  (35)  and  symmetry 
relations  (46),  we  easily  find  that 

z\l)(C\x,t)-^e2llx*Si(i'( 0,l)r.  ->  e~2!lx-8il\ l,0)r,  (->»,  (1 13) 


Thus, 


n  —  -  e-2i{x-$i{i'(0, 1  )T,  0%\c,x.t) 


-e2ilx+ii(3,(\,0)T,  £-»*. 


(114) 


f  -^^\C\x,t)n{;)TU;x'.t)dC=-  \  e^-^dC diag(0,l),  (115) 


where  the  integration  path  C~  is  the  lower  semicircle  of  infinite  radius  in  anticlosewise  direction. 
Since  the  function  e2l^x~x  *  in  the  right  hand  side  of  the  above  equation  is  analytic,  its  integration 
path  can  be  brought  up  to  the  real  axis  R.  In  the  sense  of  generalized  functions,  that  integral  is 
equal  to  nSix-x');  thus 


"TT T Z(1/)(fu,r)0(I/)r(f;.r/,r)^=-  ttS(x-  *')diag(0, 1).  (116) 

J(r  533lb) 

Doing  the  same  calculation  for  the  integral  of  along  the  upper  semicircle  of  infinite 

radius  C*  in  closewise  direction  and  combining  it  with  the  above  equation,  we  get 


-  -  f  -^—^l\C,xm^TU\x\t)dC-  -  \  ^—2^\c-,xd)^T(C;x',t)dC=  S(x-x')l. 
77 J<r  533(^)  77 Jc*  •y33(f) 


(117) 


Now  we  evaluate  the  two  integrals  in  the  above  equation  by  bringing  down  the  integration  paths 
to  the  real  axis  and  picking  up  pole  contributions  by  the  residue  theorem.  Recalling  that  zeros  of 
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s33  and  S33  are  simple,  the  poles  in  the  above  integrals  are  second  order.  After  simple  calculations, 
we  get  the  closure  relation  for  the  general  case  (with  discrete-spectrum  contributions)  as 


-ir 

7TJ-  a 


4»(f) 


2  N 


^-2 


2/ 


2  N 


2  i 


_ 

X((,;xj)0<')T^  ~  _2/r, 

"b  J  jm  1  S&Xij) 

+  —^Uj;x,t)^)T(ifx',t)  =S(x-x')L 
"fe 


J=l  s33 (£j)  - 


4%;x,t)0¥TUj;x',t) 

(118) 


where 


d£l{P  _ 


Mil 


n</> 


(119) 


e^fyjjc.r)  =  ^Uy,x,t)  -  zr^-OPUfxj) 


H 


(Cj) 


(120) 


12 

which  have  been  termed  “derivative  states”  4  due  to  the  differentiation  with  respect  to  f.  Clearly, 
Z^(£/),  ^ISCUjl  and  dz!f  I d£(£j)  satisfy  the  linearized  Sasa-Satsuma  equation  and  are 

thus  discrete  squared  eigenfunctions,  while  Cl^((j)y  and  sat*s^Y  the  ad¬ 

joint  linearized  Sasa-Satsuma  equation  and  are  thus  discrete  adjoint  squared  eigenfunctions. 

In  soliton  perturbation  theories,  explicit  expressions  for  squared  eigenfunctions  under  soliton 
potentials  are  needed.  For  such  potentials,  53i=532=513=523=0;  thus  G=1  in  the  Riemann-Hilbert 
problem  (33).  Solutions  of  this  Riemann-Hilbert  problem  have  been  solved  completely  for  any 
number  of  zeros  and  any  orders  of  their  algebraic  and  geometric  multiplicities.2  In  the  simplest 
case  where  all  zeros  of  the  solitons  are  simple,  solutions  P±  can  be  found  in  Eq.  76  of  Ref.  25 
(setting  =  r^  =  1).  One  only  needs  to  keep  in  mind  here  that  the  zeros  of  s33  and  s33  always 
appear  in  quadruples  due  to  the  symmetries  (112).  Once  solutions  P±  are  obtained,  the  solitons 
w(jc,f)  follow  from  Eq.  (37).  Taking  the  large-*  asymptotics  of  P±  and  utilizing  the  symmetry 
conditions  (10),  the  full  scattering  matrix  S  can  be  readily  obtained.  Then  together  with  P± , 
explicit  expressions  for  all  the  Jost  functions  d>,  'F  as  well  as  squared  eigenfunctions  are  then 
derived. 


IV.  SUMMARY  AND  DISCUSSION 

In  this  paper,  squared  eigenfunctions  were  derived  for  the  Sasa-Satsuma  equation.  It  was 
shown  that  these  squared  eigenfunctions  are  sums  of  two  terms,  where  each  term  is  a  product  of 
a  Jost  function  and  an  adjoint  Jost  function.  The  procedure  of  this  derivation  consists  of  two  steps: 
One  is  to  calculate  the  variations  of  the  potentials  via  variations  of  the  scattering  data  by  the 
Riemann-Hilbert  method.  The  other  one  is  to  calculate  variations  of  the  scattering  data  via  varia¬ 
tions  of  the  potentials  through  elementary  calculations.  It  was  proved  that  the  functions  appearing 
in  these  variation  relations  are  precisely  the  squared  eigenfunctions  and  adjoint  squared  eigen¬ 
functions  satisfying,  respectively,  the  linearized  equation  and  the  adjoint  linearized  equation  of  the 
Sasa-Satsuma  system.  More  importantly,  our  proof  is  quite  general  and  also  holds  for  other 
integrable  equations.  Since  the  spectral  operator  of  the  Sasa-Satsuma  equation  is  3  X  3  while  its 
linearized  operator  is  2  X  2,  we  demonstrated  how  the  two-component  squared  eigenfunctions  are 
built  from  only  selected  components  of  the  three-component  Jost  functions,  and  we  have  seen  that 
symmetry  properties  of  Jost  functions  and  the  scattering  data  play  an  important  role  here. 

The  derivation  used  in  this  paper  for  squared  eigenfunctions  is  a  universal  technique.  Our 
main  contributions  to  this  method  are  triplefold.  First,  we  showed  that  for  a  general  integrable 
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system,  if  one  can  obtain  the  variational  relations  between  the  potentials  and  the  scattering  data  in 
terms  of  the  components  of  the  Jost  functions  and  their  adjoints,  then  from  these  variational 
relations  one  can  immediately  identify  the  squared  eigenfunctions  and  adjoint  squared  eigenfunc¬ 
tions  which  satisfy,  respectively,  the  linearized  integrable  equation  and  its  adjoint  equation.  Sec¬ 
ond,  we  showed  that  from  these  variational  relations,  one  can  read  off  the  values  of  nonzero  inner 
products  between  squared  eigenfunctions  and  adjoint  squared  eigenfunctions.  Thus  no  longer  is  it 
necessary  to  calculate  these  inner  products  from  the  Wronskian  relations  and  the  asymptotics  of 
the  Jost  functions.  Third,  we  showed  that  from  these  variational  relations,  one  can  immediately 
obtain  the  closure  relation  of  squared  eigenfunctions  and  adjoint  squared  eigenfunctions.  After 
squared  eigenfunctions  of  an  integrable  equation  are  obtained,  one  then  can  readily  derive  the 
recursion  operator  for  that  integrable  equation,  since  the  squared  eigenfunctions  are  eigenfunctions 
of  the  recursion  operator. 

An  important  remark  about  this  derivation  of  squared  eigenfunctions  is  that  it  uses  primarily 
the  spectral  operator  of  the  Lax  pair.  Indeed,  the  key  variation  relations  (65),  (89),  and  (90)  were 
obtained  exclusively  from  the  spectral  operator  (2),  while  the  time-dependent  versions  of  these 
relations  (74),  (95),  and  (96)  were  obtained  by  using  the  asymptotic  form  of  the  time  evolution 
equation  in  the  Lax  pair  as  \x\— ►«:.  This  means  that  all  integrable  equations  with  the  same  spectral 
operator  would  share  the  same  squared  eigenfunctions  (except  an  exponential-in-time  factor  which 
is  equation  dependent).  For  instance,  a  whole  hierarchy  of  integrable  equations  would  possess  the 
“same”  squared  eigenfunctions,  since  the  spectral  operators  of  a  hierarchy  are  the  same.  This 
readily  reproduces  earlier  results  in  Refs.  10,  11,  and  19  where  such  results  were  obtained  by  the 
commutability  relations  between  the  linearized  operators  and  the  recursion  operator  of  a  hierarchy. 

Since  the  derivation  in  this  paper  uses  primarily  the  spectral  operator  of  the  Lax  pair,  if  two 
integrable  equations  have  similar  spectral  operators,  derivations  of  their  squared  eigenfunctions 
would  be  similar.  For  instance,  the  spectral  operator  of  the  Manakov  equations  is  very  similar  to 
that  of  the  Sasa-Satsuma  equation  [except  that  the  potential  term  Q  in  the  Sasa-Satsuma  spectral 
operator  (20)  possesses  one  more  symmetry].  Making  very  minor  modifications  to  the  calculations 
for  the  Sasa-Satsuma  equation  in  the  text,  we  can  obtain  squared  eigenfunctions  for  the  Manakov 
equations.  This  will  be  demonstrated  in  the  Appendix. 

With  the  squared  eigenfunctions  obtained  for  the  Sasa-Satsuma  equation,  a  perturbation 
theory  for  Sasa-Satsuma  solitons  can  now  be  developed.  This  problem  lies  outside  the  scope  of 
the  present  article  and  will  be  left  for  future  studies. 
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APPENDIX:  SQUARED  EIGENFUNCTIONS  FOR  THE  MANAKOV  EQUATION 

The  Manakov  equations  are 


iut  +  uxx  +  2(|//|2  +  \  v\  2)u  =  0,  (A  1 ) 

iut  +  vxx  +  2(|w|2  +  H2)i?  =  0.  (A2) 

The  spectral  operator  for  the  Manakov  equations  is  Eq.  (2)  with 
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(A3) 


Below  we  will  use  the  same  notations  on  Jost  functions  and  the  scattering  matrix  as  in  the  main 
text.  Since  Q  above  is  also  anti-Hermitian,  Jost  functions  and  the  scattering  matrix  satisfy  the 
involution  properties  (9)  and  (10)  as  well.  The  main  difference  between  the  Sasa-Satsuma  equa¬ 
tion  and  the  Manakov  equations  is  that  the  potential  Q  above  for  the  Manakov  equations  does  not 
possess  the  additional  symmetry  (4);  thus  the  Jost  functions  and  the  scattering  matrix  do  not 
possess  the  symmetries  (11)  and  (12).  In  addition,  the  asymptotics  (55)  also  needs  minor  modifi¬ 
cation.  It  is  noted  that  the  spectral  operator  of  the  Manakov  equations  is  3  X  3,  while  the  linearized 
operator  of  these  equations  is  4  X  4.  So  one  needs  to  build  four-component  squared  eigenfunctions 
from  three -component  Jost  functions.  This  contrasts  the  Sasa-Satsuma  equation  where  one  builds 
two-component  squared  eigenfunctions  from  three-component  Jost  functions. 

Repeating  the  same  calculations  as  in  the  main  text  with  the  above  minor  modifications  kept 
in  mind,  we  still  get  Eq.  (57)  for  the  variations  (Su,8u*),  while  the  expressions  for  variations 
(SV'Sv*)  are  Eq.  (57)  with  IIn  and  1I31  replaced  by  n23  and  n32.  The  expression  for  matrix  U  is 
still  given  by  Eq.  (60).  Thus  variations  of  the  potentials  via  variations  of  the  scattering  data  are 
found  to  be 


1  P 

[Sii,8v,Su\Sv’]t=  -  -  [Z, (£;*)<%>,(£)  +  Z2(£u)<$p2(£)  +  Z-,(f;x)<5p,(£)  +  Z4(£;jr)<5p2(f)]d£, 

irj_x 


(A4) 


where 


Z.= 


031013 

031023 

011033 

021033 

032013 

.  Z2  = 

032023 

.  z3  = 

012033 

,  z4  = 

022033 

03301  j 

033021 

013031 

023031 

_  03^012  _ 

_  033022  _ 

_  013  032  _ 

_  023032  _ 

(A5) 


Variations  of  the  scattering  matrix  SS  are  still  given  by  Eq.  (83),  with  SQ  modified  in  view  of  the 
form  of  (A3).  Then  defining  vectors 


<Pl  =  (<Plj)  =  *2201  ”*2102.  <f2  =  (<P2j)  “  -*120|  +  *||02. 

01  =  (<Plj)  =  *2201  -  *1202.  02  =  (02 j)  =  ~  *2101  +  *1 1  02- 
we  find  that  variations  of  the  scattering  data  are 


(A6) 

(A7) 


<wa= 


*3l(£) 


<5 P,(0  = 


&(£)’ 


Su(x) 

\  / 

Su(x) 

Sv(x) 

&>(*) 

Su’(x) 

,  (  n2(f;x), 

/  533(f)  \ 

Su*(x) 

>*U)J 

/  \ 

_8v*(x) 

Mx) 

\  / 

Su(x) 

Sv(x) 

\  /x  1  /  / 

Su(x) 

Su'(x) 

Su*(x) 

_&>*(*)  J 

1  \ 

_<5i?*(jc) 

(A8) 


(A9) 


where 


Downloaded  25  Feb  2009  to  132.170.163.254  Redistribution  subject  to  AIP  license  or  copyright;  see  http://jmp.aip.org/jmp/copyright  jsp 


023504-21 


Squared  eigenfunctions  for  Sasa-Satsuma  equation 


J.  Math.  Phys.  50,  023504  (2009) 


^1^13 

<fol<P23 

“  ^21^33 

il,  = 

^32^13 

,  n2  = 

^32^23 

,  n3  = 

“  Vl2^3 

.  n4= 

”  ^22^33 

“  ^33^11 

”  033^21 

<Pl3<A3l 

^23^31 

^33^12  _ 

^33<P22  _ 

Vl3*fc2 

<£23^3  2 

(A10) 


For  Manakov  equations,  the  time  evolution  operator  of  the  Lax  pair  is 

Y,=  -2iC2AY+V(C,u)Y,  (All) 

where  V  goes  to  zero  as  Jt  approaches  infinity.  Introducing  time-dependent  Jost  functions  and 
adjoint  Jost  functions  such  as 


<t>('>  -  d*-2*2^  cph)  s  e2  iV2Ar<j) 


(A  12) 


and  use  them  to  replace  the  original  Jost  functions  and  adjoint  Jost  functions  in  the  Zk  and  ilk 
definitions  (A5)  and  (A  10),  the  resulting  functions  3  4  and  3  4  would  be  squared 
eigenfunctions  and  adjoint  squared  eigenfunctions  of  the  Manakov  equations.  Equi¬ 
valently,  {Z^e*4^  1 1  ,Z4e*li  l}  are  squared  eigenfunctions  and 
{Oie4^  adjoint  squared  eigenfunctions  of  the  Manakov  equations. 

It  is  noted  that  squared  eigenfunctions  for  the  Manakov  equations  have  been  given  in  Ref.  18 
under  different  notations  and  derivations.  Our  expressions  above  are  more  explicit  and  easier  to 
use. 
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Selected  Problems  in  Astrophysics  and  Cosmology 


RESONANCES  IN  HIGH  DENSITY 
NONNEUTRAL  PLASMAS 

D.  J.  Kaup 

Institute  for  Simulation  and  Training  &  Department  of  Mathematics, 
University  of  Central  Florida,  Orlando,  FL  32816 
Email:  kaup@ucf.edu;  Tel/  407-823-2795;  Fax:  407-823-6253 

Theoretical  studies  of  operation  of  crossed-field  electron  vacuum  devices,  such 
as  magnetrons  and  crossed-field  amplifiers  (CFA).  have  usually  centered  on  their 
initial  growth,  taking  this  as  an  indication  of  their  operating  modes.  In  such  an 
analysis,  one  assumes  a  growth  rate,  and  solves  the  equations  for  the  density  pro¬ 
file,  the  operating  frequency,  the  growth  rate,  and  other  features  of  these  devices. 
What  one  obtains  here  are  the  conditions  for  the  initial  operation  of  the  device. 
The  dominate  interaction  in  this  stage  is  a  Rayleigh-like  instability  which  turns 
into  a  nonlinear  diffusion  process,  whereby  the  electron  density  profile  redistributes 
itself  into  a  profile  which  will  be  in  an  equilibrium  with  the  ponderomotive-like 
forces  produced  by  the  growing  RF  fields.  Eventually  the  RF  fields  will  saturate 
and  the  device  will  then  settle  into  either  a  stationary  operating  regime  or  some 
other  regime  which  could  be  non-operating.  A  key  question  is  to  understand  this 
latter  stage  of  a  device’s  operation,  which  has  been  called  the  ^saturation  stage*. 
In  this  saturation  stage,  wherein  one  would  like  the  device  to  simply  smoothly  de¬ 
liver  RF’  power,  there  is  a  different  set  of  physical  interactions  occuring.  No  longer 
is  there  a  growth  rate  -  rather  the  amplitudes  have  saturated,  and  as  a  result,  the 
ponderomotive-like  forces  have  also  vanished  along  with  the  nonlinear  diffusion. 
Actually  there  is  no  further  need  for  a  nonlinear  diffusion,  since  the  profile  has 
already  been  reshaped  in  the  initiation  stage.  To  study  this  saturation  stage  it  is 
necessary  to  return  to  the  classical  magnetron  equations  and  re-study  the  nature 
of  the  solutions  in  a  saturation  state.  What  we  find  is  that  upon  saturation,  new 
RF  modes  appear.  In  fact,  there  are  a  total  of  five  RF’  modes  in  the  saturation 
stage,  three  of  which  have  very  fast  oscillations  in  the  vertical  direction  One  mode 
corresponds  to  a  plasma  oscillation  in  the  vertical  direction,  which  is  driven  by  the 
vertical  current  through  the  device.  The  other  two  modes  are  cyclotron  modes. 
When  operating  appropriately,  the  plasma  oscillation  has  a  resonance  with  the 
potential  flow  created  in  the  initiation  stage,  and  appears  to  be  the  dominant 
interaction  in  transferring  power  from  the  DC  field,  to  the  RF’  fields.  The  fast 
cyclotron  modes  can  interact  with  the  potential  flow  only  under  special  circum¬ 
stances,  one  of  which  occurs  when  the  CPI  T266  operates  in  an  ^ultra-low  noise* 
mode.  What  we  will  do  here  is  to  present  the  appropriate  magnetron  equations 
for  the  saturation  stage,  and  use  the  WKB  approximation  to  discuss  and  describe 
the  nature  of  these  solutions. 
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1.  Introduction 

Electronic  vacuum  devices  such  as  magnetrons  and  crossed-field  am¬ 
plifiers  (CFA)  are  important  generators  of  high  power  microwaves.  They 
are  also  relatively  simple  in  their  structure  and  operation.  They  consist  of 
a  vacuum  region,  inside  of  which,  a  high  density  electron  plasma  is  created. 
The  theory  of  their  general  operating  range  follows  straight- forwardly  from 
electrostatics  and  magnetostatics  [1],  However  developing  a  theory  for  the 
RF  oscillations  has  not  been  so  direct.  The  genesis  of  the  RF  theory  is 
the  Buneman,  Levy,  Lindson  (BLL)  theory  of  1966  [2].  Here,  by  assuming 
a  Brillouin  flow,  they  were  able  to  calculate  RF  operating  characteristics. 
However,  their  theory  was  not  successful  since  it  was  based  on  what  has 
been  called  the  magnetron  resonances,  which  occur  when  the  local  RF  fre¬ 
quency,  as  seen  by  the  electrons,  uje  =  uj  —  kvo ,  is  equal  in  magnitude  to  the 
cyclotron  frequency,  H.  (w  is  the  RF  frequency,  k  is  the  RF  wavevector,  and 
t’o  is  the  E  x  B  drift  velocity.)  The  diocotron  resonance  (the  wave-particle 
resonance  at  u;e  =  0)  did  not  appear  in  their  theory.  The  reason  for  this 
was  found  later  [3],  when  it  was  discovered  that  the  diocotron  resonance 
did  occur  «inside»  the  assumed  discontinuity  at  the  edge  of  the  Brillouin 
flow.  This  became  apparent  when  the  assumed  discontinuity  at  the  edge  of 
the  Brillouin  flow  was  replaced  by  a  steep  ramp,  inside  of  which  was  found 
the  diocotron  resonance  and  its  structure  [3].  It  was  found  that  due  to  the 
shear  flow,  a  Raleigh  instability  would  occur  inside  the  steep  ramp  at  the 
edge  of  the  Brillouin  flow,  which  then  excited  a  nonlinear  diffusion  process 
which  redistributed  the  electrons  into  a  new  stationary  density  profile  |4], 
but  one  which  would  be  in  equilibrium  with  the  ponderomotive-like  forces 
of  the  propagating  RF  wave. 

However,  the  RF  analytical  theory  has  only  been  developed  for  the 
initiation  stage,  which  is  where  the  device  is  initializing,  and  the  internal 
RF  wave  is  still  growing.  In  this  paper,  an  extension  is  presented  of  the  RF 
theory  to  the  stage  that  follows  the  initiation  stage,  called  the  ^saturation 
stage*.  The  saturation  stage  is  where  the  RF  amplitudes  have  grown  to 
their  maximum  size  and  have  saturated,  and  this  is  where  the  device  will 
steadily  deliver  RF  power.  To  obtain  this  extension,  we  return  to  the  orginal 
derivation  [5]  of  the  RF  theory  for  the  initiation  stage.  Comparison  of  the 
properties  of  the  two  stages,  it  is  seen  that  there  are  two  items  that  are 
sufficiently  different  in  these  two  stages  to  affect  the  derivation.  The  first 
item  is  that  by  the  end  of  the  initiation  stage,  a  DC  current  from  the  anode 
to  the  cathode  has  been  developed,  and  along  with  it,  an  corresponding 
component  to  the  DC  electron  E  x  B  drift  velocity  in  the  direction  from 
the  cathode  to  the  anode.  The  second  item  is  that  in  the  saturation  stage, 
since  we  must  have  a  steady  state,  there  can  be  no  growth  rate  in  the  RF 
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field  [6].  In  the  initiation  stage,  a  nonzero  growth  rate  was  1)  necessary  for 
the  existence  of  the  nonlinear  diffusion,  and  2)  necessary  for  the  existence 
of  nonsingular  solutions  of  the  RF  equations.  Thus  in  the  saturation  stage, 
the  nature  of  the  RF  solutions  will  be  significantly  different.  The  purpose 
of  this  paper  is  to  present  the  RF  solution  in  the  saturation  stage. 

In  Section  2,  we  will  discuss  the  geometry  used  and  derive  the  RF  equa* 
tions  for  the  saturation  stage,  from  the  basic  cold-fluid,  Poisson  equations. 
We  will  assume  planar  geometry  for  simplicity.  What  we  will  find  here  is 
that  due  to  the  nonzero  but  small  value  of  the  component  of  the  electron 
drift  velocity  from  the  cathode  to  the  anode,  the  saturation  RF  equations 
become  a  fifth-order  set  of  ordinary  differential  equations  (ODEs),  wdiereas 
in  the  initiation  stage,  one  only  had  a  second-order  set  of  ODEs.  The  solu¬ 
tion  of  these  ODEs  will  clearly  be  nonsingular,  in  contrast  to  the  initiation 
RF  equations,  which  would  be  singular  if  the  growth  rate  were  to  vanish. 
In  Section  3  we  shall  discuss  the  structure  of  the  solution  of  the  RF  satu¬ 
ration  equations,  and  construct  approximate  solutions.  What  we  find  here 
the  well-known  situation  when  a  small  parameter  [7]  multiplies  the  highest 
derivative(s)  in  a  set  of  ODEs:  one  has  either  a  boundary  layer  or  a  reso¬ 
nance  layer.  The  solution  of  this  problem  then  requires  a  multi-scaling  of  the 
boundary  layer  and/or  resonance  layer.  Here  the  fifth-order  system  of  ODEs 
for  the  RF  fields  will  be  studied  in  the  WKB  limit.  We  wall  see  that  three  of 
these  modes  are  «fast»,  and  do  not  occur  in  the  initiation  RF  equations.  The 
other  two  modes  are  «slow»  modes,  and  are  the  usual  two  modes  that  are 
found  in  the  initiation  stage.  We  will  summarize  our  results  in  Section  4. 

2.  Basic  equations 

The  geometry  and  configuration  that  we  shall  be  using  is  shown  in 
Fig.  1,  which  is  a  diagram  of  a  «planar  magnetron*.  At  the  bottom  is  the 
cathode,  which  is  an  electron  emitting  material.  It  is  located  in  the  xz- 
plane  and  at  y  =  0.  We  take  the  ambient  magnetic  field  to  be  directed  along 
the  negative  2-axis,  so  that  the  electron  drift  velocity  will  be  mainly  in 
the  positive  x-direction.  (In  the  saturation  stage,  this  DC  drift  current  will 
have  a  small  vertical  component  in  the  ^-direction,  as  shown.)  The  electron 
cyclotron  frequency  is  Q  =  eBo/(mc)  where  Bo  is  the  ambient  magnetic 
field.  Next  to  the  cathode  is  the  electron  sheath  which  extends  out  for  some 
distance  from  the  cathode,  as  shown.  The  anode  is  located  at  a  distance  £ 
above  the  cathode.  On  the  anode  is  a  slow  wave  structure  which  is  simply 
a  wave  guide,  along  which  the  RF  wave  wall  propagate.  This  slow  wave 
structure  is  there  to  i)  slow  down  the  electromagnetic  RF  wrave  and  ii)  to 
select  the  effective  RF  (electrostatic)  wave  vector,  k ,  that  the  plasma  will 
see  and  will  respond  to. 
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Figure  1.  The  geometry  and  configuration  of  a  planar  magnetron  or  CFA.  The 
magnetic  field  is  directed  into  the  paper  and  the  electron  drift  velocity  is  to  the 
right  The  2-axis  is  perpendicular  to  the  paper,  the  x-axis  is  parallel  to  the  cathode 
surface  and  the  y- axis  is  perpendicular  to  the  cathode. 

During  the  initiation  stage,  the  diocotron  interaction  causes  the  sheath 
to  readjust  its  profile,  from  a  pure  Brillouin  flow  to  one  that  is  in  equilibrium 
with  the  growing  RF  wave.  The  wavelength  of  the  RF  wave  will  determine 
the  periodicity  of  the  spoke-like  structure  that  evolves  from  the  sheath  due 
to  the  nonlinear  diffusion,  which  extends  from  the  surface  of  the  sheath  out 
to  the  anode.  A  typical  plot  of  this  structure  is  seen  in  Pig.  2,  which  is  a 
plot  of  the  electron  density  vs.  x  and  y.  At  the  back  is  the  Brillouin-like 
sheath  while  in  the  front  one  can  see  the  spoke  of  charge  which  carries  the 
t/-component  of  the  DC  current  to  the  anode.  Note  that  the  structure  shown 
contains  only  two  Pourier  components  in  the  x  direction:  a  DC  background 
and  a  single  value  of  k  in  elkx ,  representing  the  RF  component  [8]. 

The  basic  cold  fluid  equations  describing  this  physical  system  are: 


dtn+  V  •( v  n)  =0, 

(1) 

dt  7  +(v  ■  V)  v  -  V  0+  v  x  $2=0 , 

(2) 

V20  =  71  . 

(3) 

In  the  above  equations,  the  fluid  velocity  is  in  the  xy- plane,  and  all  quantities 
are  taken  to  be  independent  of  the  2  coordinate.  We  have  also  simplified  the 
notation  by  taking  n  to  be  the  electron  plasma  frequency  squared  (4ne2p/m 
where  e(m)  is  the  electronic  charge  (mass)  and  p  is  the  electron  number 
density)  and  (j>  to  be  {e/m)  times  the  electrostatic  potential. 
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Figure  2.  The  electron  density  for  one  period  of  a  fully  developed  spoke  structure 
by  the  initiation  stage  on  the  sheath  The  anode  is  in  the  foreground  and  the 
cathode  toward  the  back.  Taken  from  Fig.  5  of  Ref  [8|. 

Let  us  now  proceed  to  expand  these  equations  in  the  following  manner. 
Let  us  take 


and  similar  for  the  other  quantities.  In  the  above,  the  subscript  «0»  will 
refer  to  a  DC  quantity  while  the  subscript  «1»  will  refer  to  a  RF  quantity. 
Also,  k  is  the  wave  vector  for  the  RF  wave  and  u  is  the  RF  frequency,  both  of 
which  are  real.  Note  that  we  do  not  include  any  «second-order  DC  terms*, 
as  has  been  done  for  the  initiation  stage  [5].  The  reason  for  this  is  that 
by  the  end  on  the  initiation  stage,  any  such  corrections  would  typically  no 
longer  be  small,  and  therefore  should  be  combined  with  the  lower  order  term 
as  a  general  DC  term.  Thus  we  will  only  have  the  DC  and  the  RF  Fourier 
modes.  As  a  consequence,  the  DC  equations  will  contain  second  order  terms 
which  arise  from  the  averaging  of  the  RF  fields,  in  addition  to  the  usual 
DC  quantities  and  terms.  Also,  unlike  in  the  initiation  stage,  the  DC  drift 
velocity  will  have  both  an  x-  and  y-component,  as: 


V=  v0x{y)x  +  v0y(y)y , 


where  the  y-component  will  arise  from  the  ponderomotive-like  forces  in  the 
DC  equations. 

Using  the  expansion  form  in  (4),  upon  expanding  (1)  -  (3),  one  obtains 
for  the  DC  components  of  these  equations: 
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-^{novOy  +  n1vly  +  niViy)  =  0,  (6) 


Note  the  quadratic  RF,  poneromotive-like  terms  present.  Meanwhile,  from 
the  RF  components  of  the  same  equation,  which  are  the  linear  perturbations 
of  (l)-(3),  we  obtain: 


—  (n\Voy  4-  noviy)  4-  ikn 0vix  -  iujen i  =  0, 

dv  Ox  dx)\x  A 

Vly~dy~  +  V°v~dy - ^ Vly  ~  tu)eVi:r  ~  tk^  = 

-^{VQyVly)  -  iu>eV\y  +  ^  =  0, 


—  k2<t>  i  —  ni 


=  0, 


(10) 

(11) 

(12) 

(13) 


where 


ujg  —  uj  —  kvQx  . 


(14) 


Note  that  when  voy  is  zero,  Eqs.  (10)  -  (13)  become  the  same  RF  equations 
as  those  used  for  the  initiation  stage  [5].  In  this  case,  the  vanishing  of  t?oy 
causes  the  derivatives  of  m,  v\x  and  tqy  to  vanish  from  these  equations.  One 
thus  is  left  with  a  second-order  set  of  ODEs  and  three  algebraic  relations. 
However, when  voy  is  nonzero,  one  then  has  all  five  derivatives  of  the  RF 
quantities  present,  and  furthermore,  Eqs.  (10)  -  (13)  can  be  solved  for  each 
derivative  of  the  RF  quantities.  Doing  so,  one  obtains 


dn\ 

dy 


^[(not,iv-n,VOy)  (^-^e) 


driQ 

-^■Vlyv0y- 


d<t>  i  , 

-  tiq— - h  novix 

dy 


(D  tfct>oy)J  i 


1 

v0y 


+  i^eVix  4-  ik<P\ 


(15) 

(16) 


dvu 

dy 
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Clearly,  this  is  a  fifth-order  set  of  nonsingular,  linear  ODEs  for  the  RF 
quantities. 

These  latter  equations  differ  from  the  RF  equations  of  the  initiation 
stage  in  two  aspects.  First,  in  the  initation  stage,  vqv  =  0  which  gave  us 
only  a  second-order  set  of  ODEs.  Second,  in  the  initiation  stage,  we  have 
a  nonzero  growth  rate,  in  which  case,  uje  has  a  nonzero  constant  imaginary 
part.  Consequently,  one  can  solve  (10)  for  m,  and  the  combination  of  these 
two  aspects  allows  one  to  obtain  nonsingular  solutions  to  the  RF  equations. 
In  the  saturation  stage,  the  growth  rate  must  vanish,  and  so  uje  must  then 
be  real,  vanishing  identically  at  the  diocotron  resonance,  which  would  give 
a  singularity  in  the  RF  initiation  ODEs.  However  if  we  take  t’oy  to  be 
nonzero,  this  is  of  no  consequence  since  we  then  have  a  fifth-order  set  of 
nonsingular  ODEs,  even  at  the  diocotron  resonance.  We  note  that  in  gen¬ 
eral,  from  Eq.  (6),  which  is  the  same  as  the  conservation  of  the  DC  current 
in  the  vertical  direction,  once  this  current  is  initiated  in  the  initiation  stage 
and  is  present  and  nonzero,  provided  the  fraction  of  the  RF  contribution  is 
not  too  large,  we  would  expect  uoy  to  always  be  nonzero. 

As  an  example  case,  we  show  in  Figs.  3  -  5  a  typical  theoretical  so¬ 
lution  for  the  CPI  (Communications  and  Power  Industries,  Inc.,  formally 
Varian  Beverly  Microwave  Division)  Tube  #T266,  which  is  a  erossed-field 
amplifier  (CFA)  operating  in  the  GHz  range.  These  figures  show  the  ex¬ 
pected  initiation  solution.  One  notes  in  Fig.  1  that  the  background  density 
consists  of  a  strong  Brilloin-like  hub  with  an  extended  foot,  on  which  the 
RF  density  variations  oscillate.  Fig.  4  shows  the  parallel  drift  velocity  while 
Fig.  5  shows  the  vertical  drift  velocity.  One  notes  that  the  vertical  drift 
velocity  is  rather  small,  typical  about  one-thousandth  of  the  parallel  drift 
velocity.  (The  spike  in  the  vertical  drift  velocity  is  due  to  the  cyclotron  res¬ 
onance  (oue  =  12  where  12  is  the  electron  cyclotron  frequency)  just  above  the 
cathode.  This  resonance  only  occurs  when  12  is  smaller  than  the  operating 
frequency,  oj.)  Except  where  the  cyclotron  resonance  occurs,  this  vertical 
drift  current  is  typically  almost  constant.  Here  the  important  point  is  that 
Vqv  is  significantly  smaller  than  the  other  parameters. 


Figure  3.  The  theoretical  background  density  for  the  T2T6  from  the  initiation 
equations.  The  units  of  no  are  (rad. /ns)2  and  of  y  are  m/100.  The  cathode  is  to 
the  left  and  the  anode  is  located  at  y  =  0.381. 


Figure  4.  The  theoretical  parallel  drift  velocity  for  the  T266  from  the  initiation 
equations.  The  units  for  Vqx  are  m/(100  ns). 


equations.  The  units  for  voy  are  m/(100  ns). 
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Due  to  the  smallness  of  t>2y,  one  finds  it  to  be  essentially  impossible  to 
directly  numerically  integrate  the  RF  equations  of  the  saturation  stage.  One 
can  see  that  since  three  of  the  derivatives  in  Eqs.  (15)  -  (18)  are  inversely 
proportional  to  at  least  t>oy,  it  follows  that  in  general  large  derivatives  will 
occur,  as  can  be  seen  in  Figs.  6  and  7,  which  shows  a  typical  example  of 
attempting  to  integrate  these  equations  directly.  As  one  can  see,  the  density 
oscillations  dominate  while  the  velocity  oscillations  are  an  order  smaller. 
(The  potential  oscillations  are  another  order  smaller  still.) 

Considering  that  these  fast  oscillations  are  present,  it  therefore  becomes 
appropriate  to  use  WKB  methods  to  separate  out  the  fast  variations  from 
the  slow  ones.  In  the  next  section,  we  shall  use  a  WKB  expansion  to  isolate 
the  three  fast  modes  from  the  two  slow  modes  and  discuss  the  nature  of  the 
solution  represented  by  each  mode. 


Figure  6.  The  magnitude  of  the  RF  Ni  as  a  function  of  y  from  a  direct  numerical 
solution  of  Eqs  (15)-(18).  for  the  background  given  in  Figs.  3-5. 
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Figure  7.  The  magnitude  of  the  RF  vix  as  a  function  of  y  from  a  direct  numerical 
solution  of  Eqs.  (15)-(18),  for  the  bac  kground  given  in  Figs.  3-5.  The  insert  shows 
that  the  solution  is  actually  comprised  of  more  than  one  fast  mode,  three  in  fact. 
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3.  Fast  and  Slow  RF  Modes 

What  we  will  do  here  is  to  start  with  a  given  DC  background,  such 
as  that  in  Figs.  3-5  and  proceed  to  solve  the  fifth-order  set  of  the  RF 
saturation  equations,  (10)- (13)  by  the  WKB  method,  using  the  smallness 
of  voy  as  an  expansion  parameter.  We  take  t’oy  to  be  of  order  e  and  assume 
a  fast  dependence  of  the  phase  of  the  RF  quantities  on  the  coordinate  y. 
Thus  we  take 

n,  =  Anel6^  ,  vix  =  Axei$/t,  vly  =  Ayei6'<,  <p  =  A*ei6/c ,  (19) 


where 


(20) 


with  a  of  order  unity.  Then  upon  replacing  vqv  with  ft»oy  in  Fqs.  (10)-(13) 
and  expanding  these  equations  in  powers  of  e,  we  obtain  a  set  of  homoge¬ 
neous  algebraic  equations.  Requiring  the  determinant  of  the  coefficients  to 
vanish  in  leading  order  of  e  gives  the  fifth-order  polynomial 


a2(av0y  -we)  [(awoj/  -  We)2  -  A2]  =0,  (21) 


where 

A  =  +  n0  -  12  ^  .  (22) 

The  two  a  =  0  modes  in  (21)  are  just  the  two  slow  modes  found  in  the  initia¬ 
tion  RF  equations.  The  other  three  modes  are  fast  modes.  Their  eigenvalues 
and  unnormalized  eigenvectors,  [An>  Ax,  v4y,  Ap],  are  given  by 

ai  =  (u;e  +  A )/v0y  ; 

[— rfno/k2,  — ia\(mega  —  v'0x)/k 2,  a\\ /k2,  rio/k 2]  ,  (23) 


<*2  =  (^e  -  A)/v0y  ; 

[~aln0/k2,  -ia2(D  —  v'0x)/k2,  -a2\/k2,  n0/k 2]  ,  (24) 


a3  =  u>e/vo y  ; 

[-a2S2(S2  -  v'0x)/k2,  ia3(ll  -  v'0x)/k2,  -«/*,  12(12  -  v'0x)/k2]  ,  (25) 

where  Vqx  is  the  derivative  of  vox  with  respect  to  y.  (In  the  DC  initiation 
equations,  v'0x  is  equal  exactly  to  n0/D,  in  which  case  A  would  exactly  equal 
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il.  However  from  the  DC  saturation  Eq.  (7),  we  see  that  in  general  this  need 
will  not  be  so  in  the  saturation  stage.) 

The  nature  of  these  eigenmodes  can  be  deduced  from  the  eigenvectors 
and  the  overall  phase.  For  the  first  two  eigenvectors,  since  the  q's  are  inverse 
proportional  to  Voy ,  and  as  we  have  already  seen  in  Figs.  6  and  7,  the 
dominate  component  is  the  density  oscillations,  with  the  velocities  being 
a  lower  order  and  consisting  of  a  right-  or  left-handed  elliptical  motion.  The 
orientation  of  the  elliptic  oscillation  depends  on  the  location.  In  the  sheath, 
since  no  there  is  essentially  H2,  the  motion  will  mainly  be  orientated  in 
the  ^-direction  when  vf0x  is  close  to  uq/Q.  Outside  the  sheath,  where  no  is 
significantly  less  than  D2,  the  motion  will  be  approximately  circular. 

Turning  next  to  the  overall  phase  of  these  modes, 


(26) 


we  see  that  if  we  follow  the  motion  of  the  guiding  center  of  a  fluid  particle, 
over  a  small  time  interval  of  time,  At ,  the  change  in  the  phase  is  given  by 


A  =kAx  —  ujAt  +  ctjAy  = 
=(kv0x  -  u>  4-  o^roy)^  = 


(27) 


=  ±  XAt , 


for  the  first  two  eigenmodes.  Thus  following  the  guiding  center,  any  small 
motion  of  the  fluid  particle  about  the  guiding  center  is  seen  to  be  oscillating 
at  a  modified  cyclotron  frequency,  A.  These  eigenmodes  are  therefore  the 
fluid  analogy  of  the  well-known  «Slater  orbits*  [1|.  On  the  other  hand,  if 
we  stay  fixed  in  x,  the  Eulerian  frame,  we  see  rapid  oscillations  in  yt  as  we 
have  already  seen  in  Figs.  6  and  7.  These  density  oscillations,  at  a  fixed  t 
would  be  almost  parallel  to  the  x-direction,  except  near  a  resonance,  where 
a  would  be  zero. 

For  the  third  eigenmode,  we  again  have  that  the  dominate  compo¬ 
nent  of  the  eigenvector  is  the  density  oscillations,  with  the  motion  mainly 
in  the  parallel  direction,  and  with  the  two  velocity  components  in  phase. 
Thus  there  is  no  cyclotron-like  motion  in  this  mode.  Rather,  the  motion  is 
a  back-and-forth  motion.  If  we  follow  the  motion  of  the  guiding  center  of 
a  fluid  particle  for  this  mode,  we  find  that  (27)  has  a  zero  change  along  the 
motion  of  the  guiding  center.  Thus  the  fluid  particles  have  a  zero  oscillation 
frequency  relative  to  the  guiding  center.  Thus  they  are  drifting.  Below  the 
diocotron  resonance  (ue  >  0),  if  the  fluid  particle  moves  in  the  positive 
x-direction  relative  to  the  guiding  center,  it  also  moves  slightly  downward 
in  the  y-direction,  and  the  opposite  occurs  if  it  is  above  the  diocotron  res¬ 
onance.  In  the  Eulerian  frame  with  x  fixed,  we  again  would  see  the  strong 


318 


density  oscillations  in  y,  and  at  a  fixed  t ,  they  would  also  be  almost  parallel 
to  the  x-direction,  except  near  the  diocotron  resonance. 

When  the  a’s  are  well  separated,  the  modes  are  noninteracting.  How¬ 
ever  when  they  cross,  or  are  equal,  they  interact,  and  can  transfer  energy 
from  one  mode  to  another.  For  the  two  zero  modes,  they  continually  inter¬ 
act,  and  satisfy  the  well  known  initiation  equations  [5],  except  at  the  dio¬ 
cotron  resonance,  where  these  equations  become  singular  when  the  growth 
rate  vanishes.  However,  when  they  do  become  singular,  at  the  same  time 
these  modes  will  be  interacting  with  the  third  mode.  It  is  this  interaction 
which  allows  a  nonsingular  solution  of  the  zero  modes  at  the  diocotron  reso¬ 
nance,  with  the  result  of  energy  being  transferred  from  one  inode  to  another. 
A  plot  of  these  a’s  vs.  y  for  the  background  given  in  Figs.  3  -  5  is  given  in 
Fig  8. 


Eigenvalues  vs.  y 


Figure  8.  The  five  modes  for  the  background  given  in  P  igs.  3-5.  Note  that  the 
three  fast  modes  never  cross  and  tend  to  have  a  fixed  separation,  except  near  the 
resonance  between  the  low  cyclotron  mode  and  the  slow  modes  near  the  cathode. 
The  main  resonance,  the  diocotron  resonance,  occurs  at  the  edge  of  the  sheath 
( y  7a  0.25)  and  corresponds  to  the  crossing  of  the  slow  modes  (a  =  0)  and  the 
drift  mode  (a  =  We./vov). 

Note  that  the  first  three  modes  will  never  interact,  since  their  a’s  could 
never  be  equal,  unless  A  would  happen  to  vanish.  This  is  not  expected  to 
ever  happen  since  generally  vf0x  would  be  near  the  value  of  no/H.  However, 
any  of  these  three  fast  modes  could  cross  the  zero  modes,  depending  on  the 
parameters,  in  which  case  they  would  interact.  For  an  operational  device,  the 
third  mode  must  cross  the  zero  modes;  this  is  the  interaction  which  allows 
the  device  to  generate  RF  power.  Thus  an  understanding  of  the  crossing 
of  the  third  mode  with  the  zero  modes  is  of  prime  importance.  Also  of 
importance  is  when  the  second  mode  crosses  the  zero  modes.  This  occurs 
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in  the  example  shown  in  Figs.  3-5,  and  appears  to  be  responsible  for  the 
ultra- low  noise  operation  of  this  device  [9,  10].  No  known  examples  of  when 
the  first  mode  would  cross  the  zero  modes  are  known,  nor  is  it  known  if 
this  interaction  would  be  of  value.  However,  if  it  did  occur,  it  would  have 
to  occur  between  the  diocotron  resonance  and  the  anode. 

4.  Conclusions 

We  have  shown  that  the  full  problem  of  the  theory  of  the  magnetron 
is  basically  a  fifth-order  system,  which  consists  of  two  slow  modes,  two  fast 
cyclotron  modes  (Slater  orbits),  and  one  fast  drift  mode  (the  diocotron 
mode).  We  have  also  detailed  the  nature  of  the  WKB  solutions  of  these 
modes,  and  have  indicated  what  resonances  between  these  modes  would 
be  important.  Of  course,  key  questions  remain  concerning  the  appropriate 
boundary  conditions  for  these  modes.  Future  work  will  be  concerned  with 
these,  as  well  as  with  further  details  of  the  nature  of  these  modes,  and  the 
calculation  of  the  conversion  rates  at  the  resonances. 
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Evolution  of  the  Scattering  Coefficients  of  the 
Camassa-Holm  Equation,  for  General  Initial  Data 


By  D.  J.  Kaup 


We  consider  the  Camassa-Holm  equation  for  general  initial  data,  particularly 
when  the  potential  in  the  scattering  problem  of  the  Lax  pair,  m  +  k,  becomes 
negative  over  a  finite  region.  We  show  that  the  direct  scattering  problem  of  the 
eigenvalue  problem  of  the  Lax  pair  for  this  equation  may  be  solved  by  dividing 
the  spatial  infinite  interval  into  a  union  of  separate  intervals.  Inside  each  of 
these  intervals,  the  initial  potential  is  uniformly  either  positive  or  negative. 
Due  to  this,  one  can  define  Jost  functions  inside  each  interval,  each  of  which 
will  have  a  uniform  asymptotic  form.  We  then  demonstrate  that  one  can  obtain 
the  /-evolution  of  the  scattering  coefficients  of  the  scattering  matrix  of  each 
interval.  In  the  process,  we  also  demonstrate  that  the  evolution  of  the  zeros  of 
m  +  k  can  be  given  entirely  in  terms  of  limits  of  the  scattering  coefficients  at 
singular  points. 


1.  Introduction 

The  Camassa-Holm  (CH)  equation  [1] 

mt  +  umx  +  2 ux(m  -+-  act)  =  0,  t  >  0,  x  €  /?,  (la) 

m  =  u  —  Uxx,  (lb) 
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describes  unidirectional  propagation  of  shallow  water  waves  on  a  flat  bottom 
where  u(x ,  /)  is  the  scaled  height  of  the  free  surface,  k(>0)  is  the  scaled 
critical  shallow  water  speed,  and  m(x,  /)  is  the  conjugate  momentum  of  u .  This 
equation  is  integrable,  and  has  the  Lax  pair  [1] 


where  A.  is  the  spectral  parameter  and  a  is  an  arbitrary  constant  (used  to  adjust 
the  oscillation  frequency  of  the  phase  of  the  Jost  functions).  The  integrability 
condition  for  (2)  is  (1),  with  A.  independent  of  time. 

Among  the  interesting  features  of  this  and  related  equations  is  the  existence 
of  peaked  soliton  solutions  [1-3]  which,  for  k  =  0,  although  the  solution  is 
continuous,  nevertheless  has  a  discontinuity  in  the  slope  at  the  peak  [1].  Due 
to  the  interesting  nature  of  these  solutions,  there  have  been  many  studies  of  the 
k  =  0  case  [4-8],  as  well  as  others,  which  include  applications  to  geometry  [9] 
and  studies  of  the  periodic  case  [1,  4,  6,  10].  Multisoliton  solutions  have  been 
given  a  parametric  representation  by  Matsuno  [11].  Interestingly,  this  equation 
has  also  served  as  a  counter-example  to  the  Painleve  test  for  integrability  [12]. 
More  recently,  Boyd  [13]  has  studied  the  transition  from  a  smooth  traveling 
wave  to  a  peaked  wave  (k  — >  0),  and  A.  Parker  [14]  has  shown  how  to  develop 
a  direct  method  of  solution.  Water  waves  are  not  the  only  application  of  this 
equation,  since  it  has  also  been  found  to  occur  in  elasticity  [15]  and  in 
non-Newtonian  fluids  [16]. 

As  one  can  see  from  (2a),  the  ^-component  of  this  Lax  pair  has  a  form 
where  the  inverse  of  the  spectral  parameter  multiplies  the  potential  term, 
m  +  k,  whereas  in  the  Schrodinger  equation,  the  spectral  parameter  and  the 
potential  are  additive.  This  gives  rise  to  a  different  set  of  analytical  and 
asymptotic  properties  for  the  eigenfunctions  of  this  spectral  problem  [17].  In 
particular,  whenever  the  potential  has  a  change  in  sign,  the  analytical  and 
asymptotic  properties  are  not  uniform  in  x .  The  direct  and  inverse  scattering 
problem  for  (2a),  when  the  initial  data  for  m  +  k  changes  sign,  has  not  been 
addressed.  However,  in  the  case  when  the  initial  data  for  m  +  k  never  changes 
sign,  both  the  direct  and  inverse  scattering  problem,  on  the  infinite  interval, 
had  been  earlier  addressed  [2],  and  further  described  in  a  series  of  recent 
papers  [18-20].  In  this  case,  eigenfunctions  can  always  be  chosen  so  that  the 
analytical  and  asymptotic  properties  will  be  uniform  in  x. 

An  important  feature  of  the  CH  equation  is  that  one  can  transform  from  an 
Eulerian  coordinate  to  a  Lagrangian  coordinate,  upon  which  (2a)  transforms 
into  the  Schrodinger  equation  [19].  Then  providing  m  +  k  remains  of  one 
sign,  one  can  proceed  as  in  the  solution  of  the  KdV  equation  [21].  However, 
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whenever  m  +  k  has  simple  zeros,  the  direct  and  inverse  scattering  problems  of 
(2a)  become  significantly  more  complex.  In  addition  to  singularities  appearing, 
there  is,  effectively,  a  rotation  of  nil  in  the  analytical  and  asymptotic  properties 
of  the  eigenfunction  of  (2a),  with  these  properties  not  being  uniform  in  x. 

Relevant  to  the  above,  it  has  been  pointed  out  by  McKean  [22,  23]  that 
whenever  the  initial  data  of  m  +  k  has  any  positive  region  to  the  left  of  any 
negative  region,  then  the  solution  of  (1)  will  break  in  a  finite  time.  Thus  we 
can  anticipate  that  when  the  initial  data  do  contain  a  negative  region,  then 
the  solution  of  the  more  general  initial  value  problem  would  contain  some 
new  and  interesting  features.  This  same  class  of  spectral  problems  (where  the 
potential  multiplies  the  spectral  parameter)  has  been  around  for  some  time, 
since  it  also  appears  in  the  Lax  pair  of  the  Harry-Dym  equation  [24]  and  other 
related  equations  [2,  25],  the  oscillating  two-stream  instability  (OTSI)  problem 
[26,  27],  as  well  as  in  the  related  degenerate  two-photon  propagation  (DTPP) 
system  [28-30].  The  latter  two  are  important  physical  integrable  systems, 
which  with  this  result,  can  now  be  re-attacked. 

In  this  paper,  we  define  a  set  of  scattering  coefficients,  and  also  present  the 
time  evolution  of  those  scattering  coefficients,  for  the  spectral  problem  (2a), 
when  the  potential  of  that  spectral  problem  changes  sign.  This  therefore  covers 
the  general  class  of  initial  value  problems  for  the  CH  equation,  as  well  as  the 
same  for  the  OTSI  and  the  DTPP  systems.  The  general  formulation  of  an 
Inverse  Scattering  Transform  (1ST)  for  the  CH  equation,  valid  for  general 
initial  data  on  the  infinite  interval,  provided  only  that  the  initial  data  for  m 
vanish  sufficiently  rapid  at  infinity  (the  usual  Faddeev  conditions),  will  not  be 
addressed  in  this  publication.  Rather,  here  we  shall  only  address  the  direct 
scattering  problem  and  the  evolution  of  the  scattering  coefficients. 

Our  approach  will  be  as  follows.  Given  the  initial  data  for  m ,  we  break  the 
infinite  interval  into  intervals  of  positive  and  negative  values  of  the  initial 
data  for  m  +  k.  This  is  necessary  if  one  is  to  have  uniform  analytical  and 
asymptotical  properties  for  the  Jost  functions,  inside  any  interval.  In  the 
absence  of  such  uniform  analytical  properties,  the  usual  method  of  solution 
for  the  inverse  scattering  problem  appears  intractable.  Inside  each  of  these 
intervals,  we  can  solve  the  direct  and  inverse  scattering  problems,  independent 
of  the  other  intervals,  and  define  appropriate  scattering  data  for  the  initial 
data  inside  that  interval  [29].  The  real  problem  now  becomes  how  to  evolve 
these  independent  sets  of  scattering  data  in  time.  This  problem  can  indeed  be 
solved.  By  the  use  of  the  second  component  of  the  Lax  pair,  (2b),  one  can 
determine  the  evolution  in  time  of  the  scattering  coefficients  of  each  interval. 
(The  scattering  coefficients  are  the  coefficients,  a,a,b ,  and  b  to  be  introduced 
below.  These  coefficients  are  not  the  “scattering  data.”  But  the  scattering  data 
are  a  reduced  set  which  can  be  obtained  from  the  scattering  coefficients.) 
How  to  correctly  evolve  and  couple  the  /-evolutions  of  the  various  scattering 
coefficients  will  be  shown  below.  Following  upon  this  procedure,  one  could 
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then  formulate  and  solve  the  inverse  scattering  problem  inside  each  interval  at 
any  later  time,  and  each  could  be  solved  independent  of  the  other  intervals, 
with  an  independent  reconstruction  of  m  inside  each  interval.  The  union  of  the 
solutions  in  each  interval,  over  all  intervals,  would  give  the  total  solution  for  m 
on  the  infinite  interval. 

Of  course,  given  this  result,  one  could  construct  the  complete  scattering 
matrix  for  the  entire  infinite  interval,  as  a  product  of  the  individual  scattering 
matrices  from  each  interval.  However  how  useful  this  could  be  in  constructing 
an  inverse  scattering  solution  for  the  entire  interval  remains  to  be  seen.  Our 
approach  is  to  use  known  techniques  to  solve  the  full  problem.  Then  in 
hindsight,  perhaps  a  general  solution  for  this  inverse  scattering  problem  on  the 
full  infinite  interval  could  be  formulated. 

In  Section  2,  we  shall  describe  a  key  important  feature,  which  can  be  gleaned 
directly  from  the  CH  equation.  This  is  the  fact  that  the  zeros  of  m  +  k  will 
move  with  the  flow.  In  Section  3,  we  will  show  that  one  can  obtain  the  evolution 
of  the  flow,  w,  at  a  zero  of  m  +  k,  in  terms  of  two  functions  of  t.  This  will 
be  important  for  determining  the  time  evolution  of  the  individual  scattering 
coefficients.  In  Section  4,  we  will  describe  the  Jost  functions  and  the  solution  of 
the  direct  scattering  problem  for  all  intervals.  In  Section  5,  we  will  obtain  the 
evolution  of  the  scattering  coefficients  in  each  of  these  intervals.  In  Section  6, 
we  will  obtain  the  evolution  of  the  zeros  of  m  +  k,  from  simply  a  knowledge 
of  the  scattering  coefficients,  and  will  also  related  this  evolution  directly  to  m 
itself.  In  Section  7,  we  make  some  concluding  remarks  and  outline  future  work. 


2.  Background 

Let  us  next  expand  on  some  key  points  essential  to  the  development  of  a 
general  1ST  for  the  CH  equation.  First,  for  the  CH  equation,  zeros  of  m  +  k  will 
move  with  the  flow  [19].  This  follows  from  (la),  which  can  be  rewritten  as 

(9^  +  udx)(m  +  k)  +  2  (m  +  k)8xu  =  0.  (3) 

Note  that  whenever  m  +  k  =  0,  that  point  moves  with  the  flow.  As  a 
consequence  of  this,  Lagrangian  coordinates,  (/,  r),  can  be  defined  [19]  where, 

dxX  =  y/\m+K\,  T  —  t.  (4) 

These  coordinates  have  the  property  that  the  /  value  of  a  zero  of  m  +  k 
remains  invariant  in  r.  Thus  in  the  Lagrangian  frame,  assuming  only  simple 
zeros  of  m  +  k,  one  may  label  the  zeros  of  the  initial  data  of  m  +  k  with 
Xa  <  Xb  <  •  •  •  <  Xz  providing  that  all  zeros  are  simple  (i.e.,  3xm \x  ^  0). 
(Note  that  since  we  take  m  to  vanish  at  x  =  ±oo,  then  we  must  have  an  even 
number  of  simple  zeros.)  Furthermore,  the  /-separation  between  these  zeros  is 
maintained,  since  [*h  >J\m  +  k  |  tic  is  a  constant  of  the  motion,  where  xa  and  jq> 
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Figure  1 .  An  example  of  an  initial  profile  for  m( jc,  0)  wherein  m  +  k  has  two  zeros  at  xa  and  jc/,. 

are  the  Eulerian  values  of  any  two  successive  zeros.  Given  the  Lagrangian 
values  of  the  zeros  of  the  initial  data  for  m  +  k ,  it  follows  that  one  may 
transform  the  location  of  these  zeros  back  to  the  Eulerian  frame  [19].  Thus  the 
infinite  interval  in  either  frame  can  be  broken  up  into  a  set  of  successive 
intervals,  each  of  a  fixed  x-length>  and  consisting  of  alternating  intervals  of 
positive  and  negative  m  +  k.  Inside  each  of  these  intervals,  we  can  solve  the 
direct  scattering  problem  for  the  scattering  coefficients  for  that  interval. 

We  note  that  since  the  Lagrangian  variable,  x>  is  simply  a  coordinate 
transformation  on  jc,  one  can  actually  treat  this  problem  with  either  coordinate, 
depending  on  one’s  preference.  If  one  uses  Lagrangian  coordinate,  x,  then  the 
zeros  will  be  stationary.  However,  the  signs  of  the  coefficients  in  the  transformed 
Lax  pair,  (2),  will  vary  from  interval  to  interval,  due  to  the  magnitudes  in  (4). 
If  one  uses  the  Emlerian  coordinate,  jc,  then  the  zeros  will  move,  but  they  will 
never  cross.  However,  the  structure  of,  and  the  signs  in,  the  Lax  pair,  (2),  will  be 
uniform  throughout  all  intervals.  We  shall  choose  to  use  the  Eulerian  form.  In 
this  case,  the  zeros  of  m  +  k  will  be  designated  by  jca(/)  <  Xb(t)  <  •  •  •  <  jcz(/). 
Their  values  at  t  =  0  can  be  obtained  from  zeros  of  the  initial  data  of  m  +  k. 

To  illustrate  the  above,  consider  a  simple  nontrivial  example  where  m  +  k 
has  only  two  simple  zeros,  such  as  shown  in  Figure  1,  which  is  a  plot  of 
w(jc,  0)  vs.  jc.  Note  that  we  shall  always  consider  the  physical  case  where,  as 
x  — >  ±oo,  m(j c,  t)  vanishes.  Here  we  see  that  we  have  three  intervals;  — oo  < 
x  <  xa  where  m  +  k  >  0,  xa  <  x  <  Xb  where  m  +  k  <  0,  and  X&  <  x  <  oo 
where  again  m  +  k  >  0. 

3.  Evolution  of  the  flow  at  a  zero  of  m  +  n 

The  evolution  of  u  at  a  zero  of  m  +  k  follows  from  the  CH  equation.  We 
assume  that  h(jc,  t)  to  be  regular  at  a  zero  of  m  +  k,  and  therefore  expand  u  in 
a  Taylor  series  as 

1  . 

u(x.  t)  =  rfo  +  (x  -  x0 )t)i  +  —(x  -x0)  t] 2  H - 


(5) 
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where  xo(/)  is  the  location  of  the  zero  concerned,  as  a  function  of  /,  and  the  s 
are  functions  of  t  also.  Taking  this,  and  then  by  (lb),  we  have 

m{x,  t)  =  t] 0,2  +  m,)(x  -  x0)  +  :^2.4(*  -  *o)2  H -  (6) 

where  we  have  defined 

m.M  =  m-  W+ 2-  (7) 

Now  we  determine  the  evolution  of  these  coefficients.  First,  since 
m(xo(t ),  t)  +  k  =  0  in  order  to  be  a  zero,  it  follows  from  (6)  that  ^0.2  +  k  =  0,  or 

m  =  m  +  (8) 

The  time  evolution  of  the  remaining  coefficients  follow  from  (lb).  Inserting 
(5)  and  (6)  into  (lb)  then  gives 


0 

Zr 

11 

0 

K 

OD 

(9) 

+3f?|f?i.3  =  0 

(10a) 

9^2.4  +4?7l  772.4  +  5f?2f?l.3  =  0 

(10b) 

etc.  Clearly,  all  the  coefficients  in  (5)  can  be  determined  in  terms  of  two 
functions,  rj 0  and  r]  1,  and  the  initial  data  for  u. 


4.  Direct  scattering  problem 


In  this  section,  we  shall  set  up  the  direct  scattering  problem  and  define  the 
scattering  coefficients.  There  are  several  manners  in  which  this  could  be  done. 
On  the  finite  intervals,  one  could  set  up  the  Jost  functions  to  be  defined  as  in 
a  periodic  problem.  Or  one  could  set  them  up  with  independent  boundary 
conditions  like  (1,0)  and  (0,  1).  However,  what  we  have  found  to  be  best  is  to 
set  them  up  exactly  as  plane  waves,  like  one  would  do  for  the  infinite  interval. 
To  do  this,  we  take  the  boundary  conditions  on  each  Jost  function  to  be  exactly 
those  of  a  plane  wave  with  some  wave  vector  k,  when  m  =  0.  We  will  need  to 
relate  this  wave-vector  to  the  spectra  parameter  A,  and  to  do  this,  we  simply 
consider  the  limit  of  x  — ►  ±oc  where  m  vanishes.  Then  inserting  such  a  plane 
wave  into  (2a),  upon  setting  m  —  0,  we  have  that  k  and  X  will  be  related  by 


2k 

4  k 2  1  ’ 


(11) 


Here,  we  shall  treat  the  example  case,  such  as  shown  in  Figure  1,  where 
m  +  k  has  only  two  zeros  at  x  =  xt>  and  x  =  xc .  Extension  of  the  method  to 
more  general  cases  is  obvious.  Let  us  first  consider  the  left  interval  where  x  < 
x b.  At  the  left  end,  we  define  the  Jost  functions  (p\  and  (p\  by 
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0i(.t  — ►  — oo)  — ►  e  lla. 

(12) 

4>\(x  — ►  -oo)  — ►  e,kx , 

(13) 

where  the  subscripts  on  the  Jost  functions,  and  on  the  scattering  coefficients, 
will  refer  to  those  quantities  in  this  interval  (— oo  <  x  <  xb),  which  we  shall 
refer  to  as  “Interval  #1”. 

At  the  right  boundary  of  this  interval,  at  x  =  Xb,  we  define  two  other  Jost 
functions,  0i  and  0|,  such  that  they  appear,  at  the  boundary,  to  be  plane 
waves.  Thus  we  take 

01 (*b )  =  eikx> ,  dxMxb)  =  ikeila » ,  (14) 

\}r\{xb)  =  e~,kx\  Bx^r\{xb)  =  -ike~,kXh.  (15) 

With  these  definitions,  using  the  Green’s  function  for  (2a),  standard 
techniques  [18-20,  31]  and  the  Faddeev  conditions,  one  can  show  that  inside 
Interval  #1,  (— oo  <  *  <  x^) 

1.  <f> \elkx  and  \}/\e~lkx  are  analytic  in  the  upper  half  complex  /r-plane, 

2.  4>\e~lkx  and  \jf \elkx  are  analytic  in  the  lower  half  complex  £-plane. 

Furthermore,  each  set  of  Jost  functions  is  a  linearly  independent  set  on  the  real 
k  axis  (except  for  the  usual  singularity  at  k  =  0),  Thus  we  may  relate  the  two 
sets  as  follows,  which  defines  the  “scattering  coefficients”  on  this  interval. 


<t> i  =  a\(k,  t)xfr i  +  bi{k,  t)\Jr u 

(16) 

=  a\(k,  t)\J/ 1  +ft|(A:, 

(17) 

The  inverse  of  this  is 

0i  =  a\(k,  t)4> i  -b^k,  t)<p\. 

(18) 

0i  =  a\{k,  t)0i  -  b\ (k,  t)4>\. 

(19) 

From  the  Wronskian  of  (2a),  we  have  that,  for  real  k , 

c(|di  —  b\  b\  =  1 . 

(20) 

From  the  above  relations,  and  the  fact  that  the  interval  is  a  left  semi-infinite 
interval,  it  follows  that  [31] 

1.  Both  a\  and  b\  are  analytic  in  the  upper-half  complex  £-plane, 

2.  Both  5|  and  b\  are  analytic  in  the  lower-half  complex  £-plane. 

One  should  note  that  on  the  infinite  interval,  b\  and  b\  are  not  analytic  and 
usually  only  exist  on  the  real  k  axis  (and  with  the  usual  Mk  singularity  at  k  = 
0).  However  since  we  have  only  a  semi-infinite  interval  where  a\  and  b\  can  be 
defined  at  a  finite  value  of  x ,  and  since  4> \elkx  and  (dx<f>\)elkx  are  both  analytic 
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in  the  upper-half  complex  &-plane,  then  so  are  these  scattering  coefficients. 
Similarly  for  a\  and  *1  in  the  lower  half  plane. 

In  a  similar  manner,  we  can  define  the  scattering  coefficients  for  the  other 
intervals.  First,  we  define  our  intervals. 

Interval  #1:  —  oo  <  x  <  xt(t) 

Interval  #2:  Xb(t)  <  x  <  xc(/)  (21) 

Interval  #3:  *«-(/)  <  x  <  oo. 

Let  us  now  define  the  remainder  of  our  Jost  functions.  In  general,  we  will 
take  0(0)  t0  be  the  Jost  function  with  boundary  conditions  on  the  left  (right). 
We  shall  continue  to  use  subscripts  as  needed  to  label  the  Jost  functions  and 
scattering  coefficients  according  to  the  above  intervals.  Two  of  these  intervals 
are  semi-infinite.  We  have  already  defined  the  Jost  functions  for  Interval  #1. 
For  Interval  #3,  the  0  Jost  function  will  be  defined  by 

03  (x  — ►  +oo)  — ►  eikx> 

(22) 

l/r3(jc  — ►  +oo)  — >  e  . 

All  other  boundary  conditions  will  be  at  a  zero  of  m  +  k.  At  these  boundaries, 
we  take 


<h(Xb)  =  e-ikx\ 

dx<fc(xb)  = 

-ike~ikXh, 

<h(xb)  =  eikx\ 

3x02  (Xb)  = 

ikeikx\ 

(23) 

02  (*r)  =  eikXc, 

3*  faixc)  = 

ikeikx‘ , 

h(xc)  = 

dxfoiXc)  = 

£ 

T 

T 

(24) 

<Pi(xc)  =  e-ikx‘ , 

3a:  03  (xc)  = 

£ 

7 

:§j 

T 

(25) 

fa(xc)  =  eikx‘. 

3x03  (Xc)  = 

ike,kx‘ . 

Similar  to  the  above  statements  for  Interval  #1,  we  have  the  following  properties 
for  the  Jost  functions  inside  Interval  #2,  (x^  <  x  <  xc) 

1 .  <f>ietkx,  4>ie~lkx ♦  irie~ikx,  foe1**  are  all  analytic  in  the  entire  complex  /r-plane, 

For  the  Jost  functions  inside  Interval  #3,  (jtc  <  x  <  +oo),  we  have 

1.  (p^elkx  and  \j/^e~,kx  are  analytic  in  the  upper  half  complex  /t-plane, 

2.  4>ie~lkx  and  0 ^elkx  are  analytic  in  the  lower  half  complex  &-plane. 

The  scattering  coefficients  are  again  defined  as  in  Interval  #1.  For  the  middle 
interval,  Interval  #2,  we  take 


02  =  0202  +  *202 
02  =0202  +  *202, 


(26) 
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where 

•  02,  02,  ^2,  ^2  are  entire  functions  of  k . 
From  the  Wronskian, 

a2a2  —  bibi  —  1, 


we  obtain  the  inverse  relations 


02  =  0202  _  />202- 

02  =  ^202  —  ^202- 

On  Interval  #3,  we  take 

03  =  O303  -  ^303. 

03  =  O303  -  £>303  • 

where 

1.  «3  and  b^  are 

2.  53  and  b 3  are 

analytic  in  the  upper-half  complex  £-plane, 
analytic  in  the  lower-half  complex  ^-plane. 

From  the  Wronskian, 

^3^3  —  6363  =  1 , 


which  gives  the  inverse  of  (29)  to  be 

03  =  0303  +63  03, 
03  =  0303  +  ^3  03- 


(27) 

(28) 


(29) 


(30) 


(31) 


The  above  have  defined  the  scattering  coefficients  inside  each  interval.  From 
these  scattering  coefficients,  one  could  define  the  scattering  data  [31],  and  then 
inside  any  interval,  proceed  to  solve  the  appropriate  Marchenko  equations  for 
either  the  semi-infinite  interval  or  the  finite  interval  [29].  Thus  the  potentials 
could  be  reconstructed  from  only  the  scattering  data  in  that  interval,  and  this 
reconstruction  could  be  done  at  any  instant  of  time.  However,  it  is  also 
certainly  clear  that  these  scattering  coefficients  must  be  coupled  in  some  way, 
at  least  in  their  time  evolution.  Otherwise  the  separate  intervals  would  evolve 
independent  of  each  other,  in  violation  of  the  known  nature  of  characteristics 
of  PDFs.  Thus  it  is  essential  to  address  the  time  evolution  of  these  scattering 
coefficients,  which  we  do  in  the  next  section. 


5.  The  time  evolution  of  the  scattering  coefficients 

By  the  use  of  (2b),  one  can  obtain  the  time  evolution  of  the  scattering 
coefficients.  Starting  with  Interval  #1,  as  x  —00,  in  order  to  satisfy  (12)  and 
(13),  we  must  take 
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o?0  =  ikK,  a ^  =  —  ikK 


(32) 


where  a $  is  the  value  of  a  for  <p\  and  ot^  is  the  same  for  <j>\. 

At  the  right  end,  we  evaluate  (2b)  at  x  =  Xb(t)9  which  is  where  m  +  k  =  0, 
by  means  of  the  expansion  (5),  the  values  in  (14)  and  (15),  and  the  definitions 
of  the  scattering  coefficients  in  (15),  to  express  (p  in  terms  of  i/r,  xp  and  the 
scattering  coefficients.  One  obtains  the  following  evolutions  for  the  scattering 
coefficients  a\  and  b\. 


9,0|  =  /r?062^X  fl|  +  \  {lllb  +  ijcn0b)  b'e2'kXh' 


(33) 


1 


dtb'  =  i  ~  a'e~2ikn  +  kV~2K-'Jf-]b''  (34) 

where  r]ob  and  r)\b  are  the  values  of  rj o  and  rf\  at  the  left  zero,  Xb- 

The  evolutions  for  a\  and  b\  are  exactly  the  same  as  for  (33)  and  (34), 
except  that  one  replace  a\  and  b\  by  a\  and  b i,  and  /  by  —  /.  Thus 


k  1  (  i  \  - 

dta\  =  -/r/w,— ai  +  -  I  r)]b  -  —  m  I  b 

8'{>  =  K1”1  +  5”“)  'a'*“ku  "  Tk  [ 


-2  ikxb 


X  —  2k 


ridbK  I  r 

—J*1- 


(35) 


(36) 


In  exactly  the  same  manner,  one  can  find  the  evolutions  of  the  other 
scattering  coefficients.  Letting  r] qc  and  r]\c  be  the  values  of  rjo  and  77 1  at  the 
right  zero,  jcc,  we  then  find  that 

a  ./  ,  k  2ri\ck  +  irjoc  2ikx  2rj]bk  -  irjob  _2/fct„T 

9,a2  -  i(noc  -  m)^rrai  + - 77 - eT^b 2 - - - e  nkXhb2, 

2 kX  4 k  4k 


(37) 


9,62  =  ^  (>?1  c  -  ^ri0c^a2e  2ikx‘  -  X-  (r?i6  -  ^bob)  a2e  2ikXh 


+ 


i  r.  ^o*  +  %c"i, 

_  ^  _  2^  _  ^ - - - j  *2. 


(38) 


a-  •/  ,  «  -  2r]\hk  +  ir]0b  2ibc,,  ,  2  mck  -  it]oc  -2^7 

9,02  =  ~i(noc  ~  WvOtt—  02 - — - + - — - e  UkXcb2, 

2kX  4k  4k 

(39) 

9,62  =  ~  ( ??I6  +  Jj^nob)  a2e1,kXh  +  +  ^no a2e2ikXc 

‘  [.  -  mb  +  noch 

-Tk[k-2K-K^r~  p-  (40) 
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and 

a,a3  =  j  yhc  -  he~1,kx\  (41) 

3A  =  (».<  -  he-211*-  +  4  [A  “  2"  “  T1] bl-  <42) 

d,a,  =  ir10c^ai  -  ^  ^|C  +  ^??oc^  hellkXc,  (43) 

a,*, = ~  (i.< + «»*“'  -  j*  [A  -  2k  -  ir] fj-  (44) 

In  the  above,  the  evolution  of  the  four  s  are  still  unstated,  and  how  this 
needs  to  be  stated  will  be  described  in  the  next  section. 


6.  The  time  evolutions  of  the  r;’s 


At  this  point,  the  only  thing  remaining  before  one  can  determine  the  evolution 
of  the  scattering  coefficients  are  the  four  functions  r]ob ,  rj\b<>  and  t]\c. 
These  quantities  are  nothing  more  than  the  values  of  u  and  ux  at  the  zeros  of 
m  +  k.  Normally,  one  could  not  determine  these  until  after  the  solution  for 
u( jc,  t)  itself  was  determined.  However,  we  shall  now  find  that  these  functions 
can  also  be  determined  by  the  evolutions  of  the  scattering  coefficients  as 
given  by  (33)-(44),  combined  with  the  analytical  properties  of  the  scattering 
coefficients,  as  detailed  earlier.  To  see  this,  let  us  first  obtain  the  structure  of 
the  scattering  coefficients  about  the  singular  points. 

Consider  (11)  when  k  =  ±//2.  At  these  points  in  the  complex  plane,  1/A.  = 
0.  Then  according  to  (2a),  the  potential  term  vanishes,  and  the  solution  for  all 
Jost  functions  becomes  trivial.  One  then  can  ascertain  that  in  the  upper-half 
complex  £-plane,  at  k  =  ±//2,  we  have  a  =  1  +  0( A.-1)  and  b  =  (9(A._1)  in  all 
three  intervals,  provided  that  the  appropriate  a  and  b  are  known  to  exist  in  that 
half-plane.  In  particular,  let  us  designate  these  limits  as  follows.  First  for  the 
limit  of  k  — >  +// 2,  let  us  take 


a  i 


02 


02 


03 


-  H7*' + 

+  ~2X^bc  +  A),  t>2 

“4?  +  0(x~2) .  h 


(45) 

^Cc"  +  0(X~2), 

(46) 

~l£"  +  0(X-2), 

(47) 

(48) 
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while  in  the  lower  half-plane,  in  the  limit  of  k  — ►  — // 2,  we  will  take 


a  i 


a2 


a2 


b\  ^ 

(49) 

+  ^C  +  o(k~2), 

b2  — > 

(50) 

b2  — > 

-i1*1' +ou~2). 

(51) 

b}  — > 

-^'ri’  +  o(A-2). 

(52) 

where  all  the  /’s  are  defined  by  these  limits.  Thus  these  /’s  are  seen  to  be 
obtainable  from  the  scattering  data. 

On  the  other  hand,  we  may  return  to  (2a),  construct  the  appropriate  Green’s 
function,  and  obtain  a  solution  for  the  Jost  functions,  and  thereby  the  scattering 
coefficients,  about  the  singular  points  k  =  ±//2,  in  an  asymptotic  series  of  1/A. 
Doing  so,  one  finds  that  these  /’s  can  also  be  related  to  the  dynamical  variable 
m  by 


rX} 

I  y=  /  m(x,t)eyxdx,  (53) 

JXi 

where  y  =  ±  1  or  0,  xa  =  — oo,  xj  =  +oo,  and  as  before,  xt>  and  xc  are  the  two 
zeros  of  m  +  k.  So,  the  7’s  may  be  interpreted  either  as  scattering  data,  or 
integrals  of  m,  as  in  (53). 

Now,  let  us  put  our  attention  on  Interval  #1.  There  we  have  that  both  a\  and 
b\  are  analytic  in  the  upper-half  complex  A-plane.  Therefore  they  must  possess 
a  Taylor  series  expansion  about  any  point  in  this  half  plane.  (Note  that  we  are 
assuming  that  the  Faddeev  conditions  will  continue  to  be  satisfied,  at  least  for 
some  finite  time  during  the  evolution,  and  that  the  solution  will  not  break  at 
t  =  0.)  Inserting  such  an  expansion  into  (33)  and  (34),  one  finds  that  the 
evolution  of  b\  will  in  general  develop  a  nonzero  value  at  the  singular  point 
k  =  H2.  This  is  in  violation  of  the  above  expansion  for  b\  and  will 
immediately  lead  to  the  instantaneous  creation  of  an  essential  singularity  at 
the  singular  point,  in  violation  of  the  stated  analytical  properties  of  b\.  This 
can  be  avoided  only  if,  in  the  limit  of  1/A  — ►  0,  the  right-hand  side  of  (34) 
vanishes.  Using  the  above  expansions,  one  can  show  that  this  coefficient  must 
be 


eXh{rjib  -  r]0b)  +  rah  =  0. 


(54) 


If  one  would  carry  out  the  same  for  the  singularity  in  the  lower-half  plane,  one 
would  find  that  the  exact  same  condition  also  arises. 

The  same  can  also  be  done  in  the  other  intervals,  noticing  that  in  the  second 
interval,  all  coefficients  must  be  entire,  and  thus  b2  must  not  be  allowed  to 
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develop  essential  singularities  are  either  of  these  points,  k  =  it 2  and  k  =  —i/2. 
This  then  gives  the  conditions 

(n\c  -  n<kVc  -  (mb  -  mb)eXh  +  I(£l)  =  0,  (55) 

(me  +  Woc)e~Xc  -  (r]ib  +  no, b)e~Xh  +  / =  0,  (56) 

while  in  the  third  interval,  we  find 

e*'(me  +  floe)  -  I{cdl)  =  0.  (57) 

Solving  these  four  conditions  for  the  r]r s,  we  obtain 

nob=l~e-^I{a+bl)+l-e^I(-l).  (58) 

=  +  (59) 

»o c  =  \e-*‘l£"+X-e*'I(-x\  (60) 

me  =  -\e-x'I%"+X-<* I(c-dU.  (61) 


Thus  by  obtaining  the  Fs  from  the  limits  of  the  scattering  coefficients  about 
the  singular  points,  we  have  the  r/’s.  Once  we  have  the  t)' s,  we  then  have  the 
evolution  of  the  scattering  coefficients,  entirely  in  terms  of  the  same  scattering 
coefficients  and  their  initial  values.  Note  that  by  (9),  the  evolution  of  the  x/s 
are  given  by  (58)  and  (60),  which  therefore  can  also  be  obtained  from  the 
scattering  coefficients. 

To  make  this  point  further,  consider  solving  an  arbitrary  initial  value  problem 
by  means  of  this  composite  1ST  Taking  the  initial  data,  we  solve  (2a)  for 
the  initial  scattering  coefficients.  Then  we  integrate  the  scattering  coefficients 
forward  in  time,  using  (33}-{44),  taking  care  that  the  rj's  in  the  coefficients  of 
these  equations  always  satisfy  (58) — (6 1 ),  where  the  Fs  are  obtained  from  the 
limits  in  (45)-(48)  or  (49)-{52).  Then  clearly,  one  can  obtain  the  scattering 
coefficients  at  any  later  time,  unless  the  solution  for  u  happens  to  break.  We 
then  obtain  the  evolution  of  the  scattering  coefficients. 

We  note  in  passing  that  due  to  (53),  which  is  obtained  from  (2a),  the  7’s, 
for  y  =  are  exactly  integrable,  because  me±x  =  dx[e±x(±u  —  ux)],  due 
to  (lb).  Using  this  relation  and  (5),  one  may  readily  verify  that  (54)-(57) 
are  indeed  correct,  which  were  obtained  instead  by  requiring  the  scattering 
coefficients  to  retain  their  appropriate  analytical  properties  as  they  evolved. 
This  is  not  surprising,  since  by  requiring  the  scattering  coefficients  to  retain 
their  analytical  properties  during  their  evolution,  we  are  also  requiring  the 
potential  in  (lb)  to  continue  to  remain  as  it  should  be. 
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7.  Conclusions 

Let  us  now  review  and  summarize  the  situation.  As  demonstrated  above,  inside 
each  interval,  one  can  obtain  a  set  of  first-order  differential  equations  for  the 
evolution  of  the  scattering  coefficients,  as  in  (33)  and  (34).  These  equations 
will  be  dependent  on  the  position  of  the  zeros,  the  xn  s,  and  the  first  two  rj ’s  at 
each  zero.  One  then  finds  that  on  each  semi-infinite  interval,  there  will  be 
exactly  one  relation,  as  in  (54)  and  (57).  On  every  internal  finite  interval,  one 
will  obtain  two  other  such  independent  relations,  as  in  (33)  and  (34).  A  simple 
counting  then  shows  that  one  has  sufficient  data  to  determine  all  functions,  as 
well  as  the  evolution  of  the  scattering  coefficients. 

Another  feature  of  these  evolution  equations  for  the  scattering  coefficients 
that  should  be  noted  is  that  they  are  linear  in  the  scattering  coefficients,  provided 
one  is  not  in  a  neighborhood  of  either  of  the  singular  points  k  =  ±//2.  However 
in  a  neighborhood  of  these  singular  points,  the  equations  actually  become 
nonlinear,  since  the  values  of  the  rj’s  in  the  coefficients  of  these  equations  are 
to  be  given  in  terms  of  the  limits  of  the  ft’s,  as  given  in  (45)-(48)  or  (49)-(52). 

Although  we  do  now  have  the  equations  for  these  evolutions,  how  practical 
they  will  be  as  a  means  for  solving  the  time  evolution  of  the  scattering  data,  as 
well  the  same  of  the  CH  equation,  remains  to  be  seen.  However,  the  above  does 
demonstrate  that  in  principle,  it  is  workable.  The  least  that  one  can  say,  is  that 
when  this  method  of  solution  is  finally  and  fully  detailed,  we  will  obtain  a  new 
understanding  of  the  CH  equation,  as  well  as  a  new  and  deeper  understanding 
of  the  method  of  the  1ST.  Perhaps  we  will  also  be  able  to  better  understand 
why  the  CH  equation  satisfies  only  the  weak  Painleve  property  [12]. 

In  concluding  this  section,  we  should  make  some  comments  on  some  possible 
complications  that  will  appear  in  deriving  the  inverse  scattering  equations  for 
this  general  initial  value  problem  of  the  CH  equation.  For  example,  in  any 
neighborhood  of  the  zeros  of  m  +  /c,  the  asymptotics  of  the  Jost  functions  are 
not  as  clean  and  simple  as  for  the  regular  Schrodinger  equation,  where  one  has 
the  simple  result  that  <f>elkx  — >  1  uniformly  in  x  as  \k\  -»  oo  in  the  upper  half 
ft-plane.  Rather,  considering  (2a)  when  one  is  in  a  neighborhood  of  a  zero  of 
m  +  k,  for  \k\  large,  one  sees  that  any  Jost  function  will  have  a  boundary  layer 
of  the  structure  of  an  Airy  function.  At  any  x  away  from  such  a  zero,  one  does 
have  4>elkx  approaching  a  constant  limit  [17]  as  |£|  — ►  oo  for  fixed  jc.  However, 
if  one  is  exactly  at  such  a  zero,  then  one  is  always  inside  the  boundary  layer 
Thus  the  limit  is  not  uniform  in  x  for  the  closed  interval.  We  note  that  this 
may  be  an  important  consideration  in  deriving  the  Marchenko  equations  for 
this  system,  as  well  as  what  has  been  called  the  “linear  dispersion  relations” 
[31],  which  is  the  same  as  the  solution  of  the  associated  Riemann-Hilbert 
problem.  The  key  point  here  is  that  the  scattering  coefficients  are  defined  by 
the  values  of  the  Jost  functions  exactly  at  these  zeros,  which  is  always  inside 
these  boundary  layers. 
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Theoretical  studies  of  the  operation  of  crossed-field  electron  vacuum  devices  such  as  magnetrons 
and  crossed-field  amplifiers  (CFA)  have  usually  centered  on  their  initial  growth,  taking  this  as  an 
indication  of  their  operating  modes,  In  such  an  analysis  one  solves  the  equations  for  the  density 
profile,  the  operating  frequency,  the  growth  rate,  and  other  features  of  these  devices.  What  one  really 
obtains  then  are  only  the  conditions  for  the  device  to  turn  on.  The  dominant  interaction  in  this  stage 
is  a  Rayleigh-type  instability  which  initiates  a  quasilinear  diffusion  process  whereby  the  electron 
density  profile  redistributes  itself  into  a  profile  which  will  be  in  equilibrium  with  the 
ponderomotive-like  forces  produced  by  the  growing  rf  fields.  Eventually  the  rf  fields  will  saturate 
and  an  operating  device  will  settle  into  a  stationary  operating  regime.  This  stage  of  a  device's 
operation  is  called  the  “saturation  stage.”  This  latter  stage  involves  a  different  set  of  physical 
interactions  from  the  initiation  stage.  No  longer  is  there  a  growth  rate;  rather  the  rf  amplitudes  have 
saturated  and  as  a  result,  the  ponderomotive-like  forces  have  also  vanished  along  with  the 
quasilinear  diffusion.  In  this  saturation  stage,  we  find  that  new  rf  modes  appear.  In  fact,  there  are  a 
total  of  five  rf  modes,  two  of  which  are  the  usual  slow  modes  of  the  initiation  stage,  and  three  of 
which  have  fast  oscillations  in  the  vertical  direction.  One  fast  mode  corresponds  to  a  drift  plasma 
oscillation  while  the  other  two  fast  modes  are  drift  cyclotron  modes.  In  this  paper,  we  will  describe 
how  the  drift  plasma  oscillation  interacts  and  couples  with  the  slow  rf  modes  at  the  diocotron 
resonance.  ©  2006  American  Institute  of  Physics.  [DOI:  10.1063/1.2199227] 


I.  INTRODUCTION 

Crossed-field  devices  such  as  magnetrons  and  crossed- 
field  amplifiers  (CFA)  are  important  generators  of  high 
power  microwaves.  They  are  also  relatively  simple  in  their 
structure  and  operation.  They  consist  of  a  vacuum  region 
inside  of  which  a  high  density  electron  plasma  is  created 
The  theory  of  their  general  operating  range  follows  straight¬ 
forwardly  from  electrostatics  and  magnetostatics.1  However 
developing  a  theory  for  the  rf  oscillations  has  not  been  so 
direct.  The  genesis  of  the  rf  theory  is  the  Buneman,  Levy, 
Linson  theory  of  1 966. '  This  theory  is  based  on  a  linear 
instability  which  at  best,  for  the  operating  mode,  lies  deep 
inside  the  electron  sheath  and  does  not  correspond  to  the 
parameter  regime  used  by  the  design  engineers.  The  pre¬ 
dicted  instability  is  indeed  there/  4  however  it  seems  to  have 
nothing  to  do  with  the  initiation  of  power  generation  in  a 
CFA  or  magnetron.  The  dominant  effect  of  the  instability 
appears  to  be  to  simply  diffuse  the  edge  of  the  sheath  with  no 
growth  in  the  generation  of  power.  We  now  know  that  this 
instability  is  related  to  the  “magnetron  resonances,”  which 
occur  when  the  local  rf  frequency,  as  seen  by  the  electrons, 
a),(=a>- kv0),  is  equal  in  magnitude  to  the  cyclotron  fre¬ 
quency,  O.  (Here  co  is  the  rf  frequency,  k  is  the  rf  wave 
vector,  and  u0  is  the  EXB  drift  velocity.) 

It  is  also  now  well  known  that  a  linear  theory  is  insuffi¬ 
cient  to  describe  magnetrons  and  CFAs,  and  it  is  necessary  to 
include  ponderomotive  effects  in  order  to  understand  an  op¬ 
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erating  device.  Davidson  was  the  first  to  point  out  the  impor¬ 
tance  of  the  diocotron  resonance,  which  occurs  whenever 
0)^=0,  in  the  operation  of  this  device.  He  showed  that  this 
resonance  leads  to  the  reshaping  of  any  positive  density  gra¬ 
dients  into  negative  density  gradients/  Also  he  had  early  on 
recognized  that  the  quasilinear  theory  would  be  an  important 
component  for  any  understanding  of  magnetron  operation. 
Following  this,  other  works  have  also  pursued  the  study  of 
nonlinear  interactions  in  magnetron  operation,  with  the  use 
of  multiple  scales.6  Q 

Let  us  note  that  typically  such  devices  are  initiated  by 
first  applying  the  ambient  magnetic  field  (and  for  a  CFA,  also 
applying  the  rf  input  signal),  and  then  secondly  ramping  up 
the  dc  voltage.  Under  these  circumstances  one  finds  that 
within  a  couple  of  cyclotron  oscillations  a  near  classical  Bril- 
louin  sheath  will  form  next  to  the  cathode.4,l(V  13  Meanwhile, 
as  the  dc  voltage  rises,  this  near-Brillouin  sheath  slowly  ex¬ 
pands  its  thickness,  typically  taking  hundreds  of  cyclotron 
oscillations  for  the  dc  voltage  to  ramp  up  to  its  full  value.  As 
the  sheath  expands,  so  does  the  value  of  the  EX B  drift  ve¬ 
locity,  u0,  at  the  edge  of  the  sheath.  If  the  operating  param¬ 
eters  are  taken  to  be  inside  the  device’s  operating  range  then 
at  some  time  shortly  before  the  voltage  reaches  its  full  value, 
the  electron  drift  velocity  will  equal  the  phase  velocity  of 
some  unstable  rf  mode  in  the  slow-wave  structure.  At  this 
point  the  electrons  at  the  edge  of  the  sheath  will  be  strongly 
accelerated  by  the  unstable  rf  mode,  initiating  it  as  a  linear 
instability  on  a  shear  flow14  described  by  a  Rayleigh-type 
equation  for  a  shear  flow.^  Once  this  linear  instability  ini¬ 
tializes,  the  electrons  at  the  edge  of  the  sheath  will  undergo 
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the  Davidson  “quasilinear”  diffusion/*  The  growth  rate  of 
this  nonlinear  instability  is  found  to  be  directly  proportional 
to  the  density  gradient  at  the  edge  of  the  sheath/'14  which  for 
the  near-Bril louin  flow  is  very  large.  The  second-order 
theory  then  predicts  that  the  dc  electron  density  profile  will 
change  and  evolve.  As  it  evolves,  the  density  profile  relaxes 
to  a  new  density  profile,  one  which  is  in  equilibrium  with  the 
propagating  and  growing  rf  wave  in  the  slow- wave  structure 
and  one  which  also  extends  all  the  way  up  to  the  anode.14 
This  instability  of  a  near-Brillouin  sheath  is  not  surprising 
since  earlier  kinetic  studies  in  magnetically  insulated 
diodes16'1  had  revealed  that  stochastic  electron  motion  oc¬ 
curs  as  the  Brillouin  limit  is  approached.  Thus  electrons  in  a 
Brillouin  flow  can  be  easily  perturbed  and  kicked  out  of  the 
flow  and  into  new  states. 

Knowing  the  ordering  in  the  physical  processes  involved 
and  due  to  the  time  scales  involved,  one  can  use  an  adiabatic 
approximation  to  construct  an  analytical  qualitative  descrip¬ 
tion  of  the  final  reshaped  density  profiles  in  the  presence  of  a 
slow- wave  structure  by  means  of  the  WKB  approximation.1* 
To  go  beyond  the  WKB  approximation  one  has  to  turn  to 
numerical  solutions5’6  which  can  be  done  in  a  self-consistent 
manner,  producing  reshaped  density  profiles  which  were  in 
equilibrium  with  a  rf  wave  propagating  in  the  slow-wave 
structure.19  20  These  solutions  were  also  shown  to  agree  quite 
well  with  the  operating  range  of  the  device  #T266,  a  CFA 
manufactured  by  CPI  (Communications  and  Power  Indus¬ 
tries,  Inc.,  formally  Varian  Beverly  Microwave  Division)  op¬ 
erating  in  the  GHz  range.  A  review  and  summary  of  this 
entire  theory  has  been  given 

The  above  analysis  was  for  the  nonrelativistic  cold-fluid 
equations  in  the  electrostatic  approximation.  The  key  ap¬ 
proximation  in  it  was  to  assume  only  three  modes;  the  back¬ 
ground  dc  fields,  only  one  frequency  for  the  oscillating  rf 
fields,  and  second-order  dc  ponderomotive  terms  generated 
by  the  oscillating  rf  fields.  Many  of  the  results  of  this  analy¬ 
sis  have  since  been  corroborated  by,  or  have  corroborated, 
results  from  other  approaches  such  as  particle-in-cell  (PIC) 
code  results  or  results  from  a  guiding  center  (GC)  approach. 
We  note  that  in  comparing  these  other  approaches  with  the 
cold-fluid  approach,  one  must  keep  in  mind  that  the  resolu¬ 
tion  afforded  by  PIC  codes  is  limited  due  to  the  relatively 
large  grid  size  required  as  well  as  the  relative  low  number  of 
particles  that  can  be  handled.  The  approximations  used  in  the 
guiding  center  approach  are  most  closely  satisfied  for  low 
density  electron  plasmas  where  the  plasma  frequency  is 
much  smaller  than  the  electron  cyclotron  frequency,  fl.21 
With  regard  to  our  cold-fluid  approach,  it  needs  to  be  noted 
that  these  solutions  are  obtained  in  the  adiabatic  approxima¬ 
tion,  are  therefore  equilibrium  solutions  and  are  valid  in  the 
limit  of  slowly  growing  rf  fields.  Also  we  note  that  Lau  and 
Chemin  2  has  pointed  out  that  eigenfunction  expansions  such 
as  those  which  have  been  used  in  our  cold-fluid  approach 
have  definite  advantages  over  other  methods. 

Summarizing  some  of  the  results  of  other  approaches, 
the  second-order  vertical  drift6'9  has  also  been  found  in  the 
GC  approximation  by  Riyopoulos.r  This  is  a  very  important 
feature  about  which  we  will  have  more  to  say  later.  This  drift 
has  also  been  observed  in  PIC  code  calculations4  as  well. 


Riyopoulos  has  also  used  the  GC  approximation  to  study  the 
initiation  of  the  instability  of  a  classical  Brillouin  sheath  24 
which  was  then  confirmed  also  in  the  cold-fluid  approach.14 
He  has  also  showed  that  in  order  to  fit  experimental  data,  it  is 
necessary  for  the  electron  density  at  the  diocotron  resonance 
to  be  taken  on  the  order  of  one-half  of  the  maximum  Bril¬ 
louin  value*  s  with  the  density  in  the  spokes  being  on  the 
same  order  of  magnitude.  This  is  indeed  what  happens  in  the 
self-consistent  cold-fluid  theory 1H  20  as  one  can  also  see  from 
figures  given  below.  Along  with  this,  it  is  to  be  noted  that 
many  PIC  code  results4’26  now  show  hub  and  spoke  features 
similar  to  that  obtained  by  our  self-consistent  cold-fluid 
treatment.  *  We  also  point  out  that  the  ultralow  noise 
feature  of  the  CPI  #T266  (Ref.  20)  has  also  been  found  in 
other  crossed-field  devices.  We  note  that  all  these  mecha¬ 
nisms  used  to  create  ultralow  noise  devices  seem  to  have  the 
consequence  that  the  normally  (assumed)  laminar  electron 
flow  just  above  the  cathode  has  been  disturbed  somehow  into 
a  more  turbulent  flow.  Lastly,  we  note  that  except  for  the 
experimental  knowledge  of  the  operating  regimes  of  magne¬ 
trons  and  CFAs,  there  has  been  no  other  corroborating  evi¬ 
dence  concerning  an  explanation  as  to  why  the  operating 
voltage  regime  of  these  devices  is  experimentally  limited  to 
less  than  20%  or  so  above  the  Hartree  value  v  and  certainly 
never  ever  really  approaching  the  higher  Hull  cutoff  voltage. 

However,  the  above  theory  applies  only  to  what  we  shall 
call  the  “initiation  stage”  which  is  where  the  device  is  ini¬ 
tializing  and  where  the  internal  rf  wave  is  still  growing.  The 
“saturation  stage”  follows  the  initiation  stage  and  is  the  stage 
where  the  rf  wave  will  have  reached  its  maximum  amplitude, 
have  saturated  and  the  device  will  be  steadily  delivering  rf 
power.  These  two  stages  differ  significantly  in  two  distinctly 
different  features.  First,  by  the  end  of  the  initiation  stage  a 
second-order  dc  flow  of  electrons  from  the  cathode  to  the 
anode  will  have  come  into  existence.  The  second  feature  is 
that  when  one  arrives  at  the  saturation  stage,  the  growth  rate 
of  the  rf  fields  will  have  vanished.2""  [Note  that  consistent 
with  this,  in  the  initiation  stage  one  does  see  the  growth  rate 
smoothly  decreasing  toward  zero  as  the  rf  amplitude  rises,  as 
shown  in  Figs.  4(b)  and  6(b)  of  Ref.  20.]  These  two  features 
have  significant  physical  and  mathematical  consequences. 
Physically,  the  quasilinear  diffusion  ceases  in  the  saturation 
stage  and  the  additional  dc  current  between  the  cathode  and 
the  anode  can  no  longer  be  neglected.  Mathematically,  one 
finds  three  new  rf  modes  appearing  and  three  narrow  reso¬ 
nance  layers  located  at  the  positions  of  the  magnetron  and 
diocotron  resonances/  ’  The  latter  are  sufficiently  narrow  that 
a  new  scaling  must  be  introduced  in  order  to  obtain  solutions 
for  the  rf  fields.  Furthermore  they  are  so  narrow  that  one 
would  not  expect  to  see  them  in  PIC  code  results.  This  is  not 
to  say  that  their  effects  are  not  contained  in  PIC  code  results, 
but  only  that  the  statistics  of  PIC  codes  do  not  have  the 
resolution  to  see  such  narrow  resonances. 

In  this  paper,  we  will  study  the  structure  of  the  rf  solu¬ 
tions  about  the  diocotron  resonance  in  the  saturation  stage 
which  contains  the  major  mechanism  for  the  transfer  of  dc 
power  into  rf  power.  In  an  operating  device,  optimally  this 
resonance  will  occur  somewhere  along  the  edge  of  the  elec¬ 
tron  sheath  where  the  density  gradient  is  large  and  negative. 
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FIG.  1 .  The  geometry  and  configuration  of  a  planar  magnetron  or  CFA.  The 
magnetic  field  is  directed  into  the  paper  and  the  electron  drift  velocity  is  to 
the  right.  The  z  axis  is  perpendicular  to  the  paper,  the  x  axis  is  parallel  to  the 
cathode  surface,  and  the  y  axis  is  perpendicular  to  the  cathode. 


Density 
U 


FIG.  2.  (Color)  The  electron  density  for  one  period  of  a  fully  developed 
spoke  structure  in  the  initiation  stage.  The  anode  is  in  the  foreground  and  the 
cathode  toward  the  back  Taken  from  Fig.  5  of  Ref.  9. 


Using  two  multiple  scales,  we  will  obtain  the  inner  and  the 
outer  solutions  around  this  resonance  to  first  order  in  an  ex¬ 
pansion  parameter  (taken  to  be  the  square  root  of  the  vertical 
dc  velocity).  From  these  solutions  we  will  obtain  the  coeffi¬ 
cients  that  asymptotically  match  the  inner  and  outer  solu¬ 
tions,  thereby  obtaining  the  mode  conversion  coefficients. 

In  Sec.  II  we  will  discuss  the  geometry  used  and  give  the 
nonrelativistic  rf  equations  for  the  saturation  stage  in  planar 
geometry  and  the  electrostatic  approximation.  9  In  Sec.  Ill 
we  will  quickly  treat  and  dispense  with  the  cyclotron  modes 
since  they  will  be  noninteracting  at  the  dioeotron  resonance. 
In  Sec.  IV  we  will  obtain  the  “outer”  solutions  which  are 
valid  outside  of,  but  near  the  dioeotron  resonance.  In  Sec.  V 
we  shall  proceed  to  use  a  multiple  scale  expansion  to  solve 
for  the  “inner”  solutions  inside  the  dioeotron  resonance.  An 
expansion  will  be  found  for  the  inner  solutions  and  the  solu¬ 
tions  will  be  given  in  terms  of  Fresnel-type  integrals.  In  Sec. 
VI  we  shall  find  the  asymptotic  form  of  the  connection  co¬ 
efficients  that  connect  the  inner  and  outer  solutions.  We  shall 
then  use  these  coefficients  to  obtain  the  conversion  coeffi¬ 
cients  which  will  give  how  any  one  mode,  as  it  passes 
through  the  dioeotron  resonance,  converts  into  the  other 
modes  in  passing  to  the  other  side.  In  Sec,  VII  we  will  make 
some  concluding  remarks. 


slow  down  the  electromagnetic  rf  wave  and  (ii)  to  select  the 
effective  rf  (electrostatic)  wave  vector,  k,  the  plasma  will  see 
and  respond  to. 

During  the  initiation  stage  the  dioeotron  interaction 
causes  the  sheath  to  readjust  its  profile  from  a  pure  Brillouin 
flow  to  one  in  equilibrium  with  the  growing  rf  wave.  The 
wavelength  of  the  rf  wave  will  determine  the  periodicity  of 
the  spoke-like  structure  which  evolves  from  the  sheath  due  to 
the  nonlinear  diffusion  and  will  extend  from  the  surface  of 
the  sheath  out  to  the  anode.  A  typical  plot  of  this  structure  is 
seen  in  Fig.  2  which  is  a  plot  of  the  electron  density  vs  x  and 
y.  At  the  back  is  the  Brillouin-type  sheath  while  in  the  front 
one  can  see  the  spoke  of  charge  which  carries  the  v  compo¬ 
nent  of  the  dc  current  to  the  anode.  Note  that  the  structure 
shown  contains  only  two  Fourier  components  in  the  x  direc¬ 
tion:  a  dc  background  and  a  single  value  of  k  in  etkx  repre¬ 
senting  the  rf  component. ' 

The  basic  cold-fluid  equations  coupled  with  Poisson’s 
equation  that  we  will  use  are: 


fyi  +  V  ■  (vn)  =  0, 

(i) 

d(\  +  (v  •  V)v  -  V<£  +  vXH  =  0, 

(2) 

II.  BASIC  EQUATIONS  AND  WKB  SOLUTIONS 

The  geometry  and  configuration  that  we  shall  be  using  is 
shown  in  Fig.  1  which  is  a  diagram  of  a  “planar  magnetron.” 
At  the  bottom  is  the  cathode  which  is  an  electron  emitting 
material.  It  is  located  in  the  x z  plane  and  at  v=0.  We  take  the 
ambient  magnetic  field  to  be  directed  along  the  negative  z 
axis  so  that  the  electron  drift  velocity  will  be  mainly  in  the 
positive  Jt  direction.  (In  the  saturation  stage  this  dc  drift  cur¬ 
rent  will  have  a  small  vertical  component  in  the  y  direction, 
as  shown.)  The  electron  cyclotron  frequency  is  fl 
=eB0/(mc),  where  B0  is  the  ambient  magnetic  field.  Next  to 
the  cathode  is  the  electron  sheath  which  extends  out  for  some 
distance  from  the  cathode  as  shown.  The  anode  is  located  at 
a  distance  £  above  the  cathode.  On  the  anode  is  a  slow-wave 
structure  which  is  simply  a  wave  guide  along  which  the  rf 
wave  will  propagate.  This  slow-wave  structure  is  there  to  (i) 


V2<£  =  m.  (3) 

In  the  above  equations  the  fluid  velocity  is  in  the  xy  plane 
and  all  quantities  are  taken  to  be  independent  of  the  z  coor¬ 
dinate.  We  have  also  simplified  the  notation  by  taking  n  to  be 
the  electron  plasma  frequency  squared  \4jre2  p/m  where  e(m ) 
is  the  electronic  charge  (mass)  and  p  is  the  electron  number 
density]  and  4>  to  be  (e/m)  times  the  electrostatic  potential. 

For  the  saturation  stage  one  expands  these  equations  sig¬ 
nificantly  different  from  that  for  the  initiation  stage.  Here  we 
take 

n(;c,y,r)  =  n0  +  (n ^l**-**)  +  c  #c  .  ),  (4) 

and  similarly  for  the  other  quantities.  In  the  above  the  sub¬ 
script  “0”  will  refer  to  a  dc  quantity  while  the  subscript  “1” 
will  refer  to  a  rf  quantity.  Also,  k  is  the  wave  vector  for  the  rf 
wave  and  <u  is  the  rf  frequency,  both  of  which  are  real.  Note 
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that  we  do  not  include  any  “second-order  dc  terms/1  by 
which  this  expansion  differs  from  that  used  in  the  initiation 
stage.6  The  reason  for  this  change  in  the  form  of  the  expan¬ 
sion  for  the  saturation  stage  is  to  incorporate  the  dc  vertical 
velocity  into  lowest  order.  By  the  end  of  the  initiation  stage, 
second-order  corrections  would  typically  no  longer  be  small, 
and  therefore  in  the  saturation  stage  should  be  directly  com¬ 
bined  with  the  lower  order  dc  terms  into  a  general  dc  term. 
As  a  consequence  the  dc  equations  of  the  saturation  stage 
will  contain  quadratic  rf  terms  in  addition  to  the  usual  dc 
terms  of  the  initiation  stage.  Thus  unlike  the  initiation  stage, 
the  dc  drift  velocity  in  the  saturation  stage  will  have  both  an 
x  and  y  component,  as: 

vo  =  v()x(y)-i  +  V()\(y)y  •  (5) 


To  obtain  the  equations  for  the  saturation  stage  we  insert 
expansions  for  the  density,  velocity  and  potential,  as  in  (4), 
into  (l)-(3)  and  expand.  Note  that  in  the  saturations  stage  we 
will  not  use  a  small  amplitude  expansion;  instead  we  only 
separate  on  the  basis  of  Fourier  components.  First  we  have 
the  dc  components,  which  are: 


T  Own-  +  ntv 
dy 


i.v 


+  «>„)  =  0, 


(6) 


!><»/—  -m  + 


V()y 


A  dy 

dy  ov 
dy 

=  0, 


.  dvu 
dy 


„  d<t>u 
+  “  ~T  + 

dy 


dv  iv 
dy 


=  0, 


-  ikv  Xx  I  +  c  .  c  . 


(7) 


(8) 


d% „ 

dy2 


~n  o  =  0. 


(9) 


Note  the  quadratic  ponderomotive-like  rf  terms  present. 
Lastly,  from  the  components  of  the  rf  fundamental, 
of  the  same  equations  [which  are  also  the  linear  perturbations 
of  (l)-(3)  we  obtain: 


* 


dv()x  dvu 

v—  +  vo  y— 
dy  dy 


dy 


d  <£±  a 


2  - k2 4>\  —  «i  =0, 


dy 

where 

co,  =  co  -  kv  ()x. 


-  icoenx  =  0, 

(10) 

icoevXx-  ikcf) |=0, 

(in 

ll 

o 

(12) 

(13) 

(14) 

When  u0\  *s  zero’  Eqs-  (10)— ( 13)  become  the  same  rf  equa¬ 
tions  as  those  used  for  the  initiation  stage  6  upon  taking  co  to 
be  complex.  In  the  initiation  stage  the  vanishing  of  u0v 
causes  the  derivatives  of  nx,  vXx ,  and  ulv  to  vanish  from  these 


FIG.  3.  The  theoretical  background  densily  for  the  T266  from  the  initiaiion 
equalions.  The  units  of  /in  are  (rad/ns)2  and  of  y  are  ml  100.  The  cathode  is 
to  the  left  and  the  anode  is  located  at  y=0.381 


equations.  Thus  the  above  set  of  five  ODEs  of  the  saturation 
stage  can  be  reduced  to  the  second-order  set  of  ODEs  and 
three  algebraic  relations  of  the  initiation  stage. 

When  u0v  is  nonzero  as  in  the  saturation  stage,  one  then 
has  all  five  derivatives  of  the  rf  quantities  present  and  fur¬ 
thermore,  Eqs.  ( 1 0)— ( 1 3)  can  be  solved  for  each  derivative  of 
the  rf  quantities.  Doing  so,  one  obtains 


dn  | 
dy 


^  Ov 


("0Uiy-«iU0vH 


d4>\ 

-n0-~-  +n0vXx(il-ikv(h) 

dv 


dn{) 

~rvuvoy 


(15) 


dv \x 

dy 


Vo  v 


n- 


dv  o, 
dy 


(16) 


dv  iy 
dy 


^Ov 


dv0Y\  n  d<f> , 


j± i 

dy 2 


-k2<f>  j  +  n  j . 


(17) 

(18) 


This  is  a  fifth-order  set  of  nonsingular,  linear  ODEs  for  the  rf 
quantities  when  u0y  is  nonzero  and  all  the  dc  quantities  exist. 
By  nonsingular  we  mean  that  the  derivatives  of  the  rf  quan- 
tites  always  exist  and  are  finite.  (The  equations  are  linear  in 
the  rf  variables  and  in  the  saturation  stage  there  are  no  de¬ 
nominator  terms  which  could  vanish  and  give  rise  to  singu¬ 
larities.) 

The  above  rf  equations  of  the  saturation  stage  differ  from 
the  rf  equations  of  the  initiation  stage  in  two  aspects.  First, 
u0v  is  nonzero  in  the  saturation  stage  and  is  zero  in  the  ini¬ 
tiation  stage  and  second,  the  growth-rate  is  zero  (which 
means  that  co  is  real)  in  the  saturation  stage  and  is  nonzero 
(which  means  that  co  is  complex)  in  the  initiation  stage 

As  an  example  of  the  solution  in  the  initiation  stage,  is 
shown  in  Figs.  3-5  a  typical  numerical  solution  for  the  CPI 
device  #T266.  One  notes  in  Fig.  3  that  the  background  den¬ 
sity  consists  of  a  strong  Brillouin-type  hub  with  an  extended 
foot  on  which  the  rf  density  variations  would  oscillate.  Fig- 
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FIG.  4.  The  theoretical  parallel  drift  velocity  for  the  T266  from  the  initiation 
equations.  The  units  for  are  m/(I00  ns). 


ure  4  shows  the  parallel  de  drift  velocity  while  Fig.  5  shows 
the  vertical  dc  velocity.  One  notes  that  the  vertical  dc  veloc¬ 
ity  is  rather  small,  typical  about  one-thousandth  of  the  paral¬ 
lel  velocity.  (The  spike  in  the  vertical  dc  velocity  is  due  to 
the  drift-cyclotron  resonance  at  a>e=il  located  just  above  the 
cathode.)  Except  where  the  dnft-cyclotron  resonance  occurs, 
this  vertical  dc  current  is  essentially  constant.  Here  the  im¬ 
portant  point  is  that  v0y  is  significantly  smaller  than  the  other 
dc  variables.  [The  reader  should  note  that  at  the  present  time 
there  exist  no  self-consistent  solutions  of  the  saturation  stage 
(6)-(13).  That  will  be  the  object  of  future  work.  The  above 
plots  are  for  the  initiation  stage.  However  one  would  expect 
those  of  the  saturation  stage  to  be  similar  to  these.] 

Due  to  the  smallness  of  u0v,  it  becomes  difficult  and 
time-consuming  to  numerically  integrate  the  rf  equations  of 
the  saturation  stage.  One  notes  that  three  of  the  derivatives  in 
Eqs.  ( 1 5)— ( 1 8)  are  inversely  proportional  to  u{)v  or  u(2)y.  Thus 
it  follows  that  in  general  large  values  of  the  derivatives  can 
occur  and  do  occur,  as  can  be  seen  in  Figs.  6  and  7  which 
shows  a  typical  solution  that  occurs  when  one  integrates 
( 1 5)— ( 1 8)  using  the  initiation  stage  dc  values  with  sample 
initial  data.  As  one  can  see,  the  density  oscillations  dominate 
while  the  velocity  oscillations  are  an  order  of  magnitude 


FIG.  5.  The  theoretical  vertical  drift  velociiy  forihe  T266  from  lhe  initial  ion 
equations.  The  units  for  u0v  are  mf{  100  ns). 
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FIG.  6.  The  magnitude  of  the  rf  Ny  as  a  function  of  y  from  a  direct  numeri¬ 
cal  solution  of  Eqs.  (15M18)  for  the  background  given  in  Figs.  3-5. 


smaller.  (The  oscillations  in  the  potential  are  even  another 
order  smaller.) 

In  Ref.  30  the  WKB  solutions  of  the  above  equations, 
( 1 5) — ( 1 8),  were  presented.  Assuming  fast  phase  variations  of 
the  form 

nx  etc  . ,  (19) 

where  An  is  a  slowly  varying  amplitude  and  a  is  a  fast  WKB 
wave  vector,  it  was  found  that  a  could  take  on  five  possible 
values  of  which  two  were  zero  and  corresponded  to  the  usual 
(slow)  modes  of  the  initiation  stage.  The  other  three  modes 
were  fast  modes  and  their  eigenvalues,  a,  and  unnormalized 
eigenvectors,  [«,  ,u1v,D]V,<£i].  in  lowest  order  were  found  to 
be 

a,  =  (coe  +  \)/i>0v.;  [-  a2/f()/A:2,  -  ia{(il  -  ul)x)/k7, 

ax\/k 2,  wo/*2],  (20) 

a2  =  (°>e-  \)/u0v;  [-  a\n{)lk2%  -  ia2(il  -  u^x)ik2% 

-a2\/k2%  njk 2],  (21) 


FIG.  7.  The  magnitude  of  the  rf  vlx  as  a  function  of  y  from  a  direct  numeri¬ 
cal  solution  of  Eqs.  (15 >— ( 18)  for  the  background  given  in  Figs.  3-5.  The 
insert  shows  that  the  solution  is  actually  comprised  of  more  than  one  fast 
mode,  three  in  fact. 
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Eigenvalues  vs.  y 


FIG.  8.  (Color)  The  five  rf  modes  for  lhe  background  given  in  Figs.  3-5. 
Noie  lhat  the  three  fast  modes  never  cross  and  tend  to  have  a  fixed  separa¬ 
tion  except  near  the  resonance  between  the  lower  cyclotron  mode  and  the 
slow  modes  which  is  near  lhe  ealhode.  The  main  resonance  which  is  lhe 
dicotron  resonance,  occurs  al  the  edge  of  the  sheath  (y**0.25)  and  corre¬ 
sponds  to  the  crossing  of  the  slow  modes  (o,=0)  and  lhe  drift  mode  (a 
=  0),/vQy). 


a3  =  d)Jv0y\  [-  a\n(n  -  Vq x)/k2,  ia^(il  -  v[)x)lk2< 

-  iil/k,  H(0  -  VQX)/k2],  (22) 

where 

\  =  VO2  +  n0  -  (23) 

and  Vqx  is  the  derivative  of  with  respect  to  y.  [In  the  dc 
initiation  equations  v'0x  is  equal  exactly  to  rt0/Sl  in  which 
case  \  would  be  exactly  equal  to  H.  However  from  dc  satu¬ 
ration  Eq.  (7),  we  see  that  in  general  this  need  not  necessarily 
be  true  in  the  saturation  stage.]  A  plot  of  these  as  vs  y  for  the 
initiation  stage  background  is  given  in  Figs.  3-5  is  given  in 
Fig.  8.  Note  that  the  typical  values  for  a  in  a  fast  mode  are  in 
the  hundreds,  which  is  large,  whereas  the  typical  value  for 
the  dc  quantities  is  unity.  Thus  as  seen  from  the  eigenvectors 
in  (20)— (22)  the  density  oscillations  dominate  in  the  fast 
modes  and  the  potential  oscillations  are  the  weakest. 

From  the  above  eigenvalues,  eigenvectors  and  the  over¬ 
all  phase,  which  is 

ipj  =  Lx  -  cot  +  j  a jdy ,  (24) 

one  can  determine  the  nature  of  these  three  fast  modes.  Fol¬ 
lowing  a  fluid  particle  one  finds  that  in  the  first  two  modes 
the  fluid  particle  executes  circular  oscillations  at  the  modi¬ 
fied  cyclotron  frequency,  about  the  guiding  center.  These 
eigenmodes  are  thus  the  fluid  analogy  of  the  well-known 
“Slater  orbits”1  which  are  the  single-particle  cycloidal  orbits 
for  electrons  in  a  Brillouin  sheath.  In  the  Eulerian  frame  one 
sees  rapid  oscillations  in  y,  as  already  shown  in  Figs.  6  and 
7.  We  shall  refer  to  these  modes  as  the  “drift-cyclotron 
modes.” 

For  the  third  eigenmode  we  again  have  that  the  dominant 
component  of  the  eigenvector  is  the  density  oscillation  with 
the  motion  of  the  fluid  particle  being  mainly  in  the  parallel 


direction  and  with  the  two  velocity  components  in  phase. 
The  motion  is  thus  a  drifting  back-and-forth  motion  and  will 
therefore  be  referred  to  as  the  “drift  mode.” 

For  any  two  modes  when  the  as  are  well  separated,  the 
modes  are  noninteracting.  However  when  they  cross  they 
interact  and  energy  can  transfer  from  one  mode  to  another. 
For  the  two  slow  modes,  they  are  of  course  continually  in¬ 
teracting  and  satisfy  the  well-known  initiation  equations6  ex¬ 
cept  near  the  cyclotron  resonances  and  the  diocotron  reso¬ 
nance.  At  the  diocotron  resonance,  the  slow  modes  cross  the 
drift  mode,  interact  and  exchange  energy  with  it. 

For  an  operational  device  the  diocotron  mode  must  cross 
the  slow  modes  near  the  Brillouin  edge.  This  is  the  interac¬ 
tion  which  allows  the  device  to  generate  rf  power.  Thus  an 
understanding  of  the  interactions  at  the  crossing  of  the  drift 
mode  and  the  slow  modes  is  of  prime  importance.  Also  of 
importance  is  when  the  second  cyclotron  mode  crosses  the 
zero  modes.  This  occurs  in  the  example  shown  in  Figs.  3-5 
and  appears  to  be  responsible  for  the  ultralow  noise  opera¬ 
tion  of  this  device.20  No  known  examples  of  when  the  first 
cyclotron  mode  crosses  the  slow  modes  are  known  nor  is  it 
known  if  this  interaction  would  be  of  interest. 

To  treat  this  problem  it  is  necessary  to  use  multiscale 
methods  to  separate  the  fast  variations  from  the  slow  ones.  In 
the  next  few  sections  we  shall  use  a  multiscale  expansion  to 
obtain  the  rf  solutions  for  all  five  modes  in  the  region  of  the 
diocotron  resonance,  which  is  near  y,SS5 0.25  in  Fig.  8. 


111.  THE  DRIFT-CYCLOTRON  MODES  NEAR  THE 
DIOCOTRON  RESONANCE 


Let  us  first  handle  the  two  drift-cyclotron  modes  near  the 
diocotron  resonance.  We  introduce  the  following  notation  for 
our  rf  eigenvector: 

V=[n\,vu,V\y,<t>\,<t>{\.  (25) 


This  eigenvector  is  to  satisfy  the  fifth-order  set  of  ODEs 
given  by  ( 1 5)— ( 18).  From  the  WKB  analysis  given  above  we 
have  that  the  solution  for  the  drift-cyclotron  modes  near  the 
diocotron  resonance  will  be  of  the  form  given  in  (20)  and 
(21 ).  This  is: 


,  ri  "<-  +  X  ,  vl'Ov  n0Voy 

Vi-Jv\  n„— —  ,t(£l-v0J—,-v0v,-i—-, 


'‘oi’ov 

\  (  Ct)e  +  \  ) 


(26) 


where  the  subscript  1  indicates  that  this  is  the  eigenvector  for 
the  first  drift-cyclotron  mode.  This  expression  also  contains 
an  overall  amplitude  factor  which  is  found  from  the  first- 
order  correction  to  the  WKB  approximation.  This  amplitude 
factor,  suitably  normalized  to  be  unitless,  for  this  mode  is 


(27) 


The  eigenvector  for  the  second  drift-cyclotron  mode  will  be 
the  same  as  for  the  first  drift-cyclotron  mode  except  that  \  is 
to  be  replaced  by  -\  and  al  replaced  by  a2. 
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As  is  obvious  from  the  above,  for  d)e— ►O,  no  compo¬ 
nents  are  singular  and  ax  and  a2  are  well  separated  from  the 
other  three  modes.  Thus  the  WKB  solution  for  the  drift- 
cyclotron  modes  will  simply  continue  across  the  diocotron 
resonance  without  any  significant  mixing  or  conversion.  In 
the  next  section  we  treat  the  outer  solutions  for  the  remaining 
three  modes  and  following  that  we  will  obtain  the  inner  so¬ 
lutions. 


IV.  THE  OUTER  SOLUTIONS  NEAR  THE  DIOCOTRON 
RESONANCE 


The  outer  solutions  are  the  solutions  for  the  eigenvectors 
when  one  is  approaching  the  diocotron  resonance  from  out¬ 
side.  From  the  WKB  solution  for  the  drift  mode,  (22),  we 
have  the  scalings  for  the  various  components  in  the  eigen¬ 
vector.  Thus  this  mode  will  be  of  the  form 


^3o  = 


kvL 


p{n  ,p 2V0y,p3 “ , P4V0yil,P5 »<* 


,iP'a3dy 


(28) 


where  the  pjS  are  unitless  functions,  of  order  unity,  to  be 
determined.  Also  the  subscript  3 o  on  the  eigenvector  simply 
denotes  that  this  is  the  outer  solution  for  the  third  eigenvec¬ 
tor.  Inserting  (28)  into  ( 1 5)— ( 1 8)  and  taking  the  lowest  order 
in  v0v  for  each  equation,  one  readily  obtains  the  solution 


a 

lPbP*P3’P4<P5}  =  : - 

kv  0v 


n  n'  n  n2 

1 


+  0(1), 


(29) 


where  the  prefactor  is  the  amplitude  factor  that  comes 
from  the  WKB  first-order  correction  for  this  mode.  Then 
from  (28)  and  (29)  we  construct  the  outer  solution,  which  in 
lowest  order  is 


—  At 


n 

9  k.Vi 


0  V 

&vl 


a2,- i 


,ifya3dx 


ftUi 


Ov 


.tiyov 


(30) 


where  A$0  is  a  constant  which  we  will  specify  later. 

Next  we  address  the  outer  solutions  for  the  slow  modes. 
In  this  case  we  can  simply  set  u0y=0  in  ( 1 0)— ( 1 3)  and  pro¬ 
ceed  to  solve.  The  resulting  equations  are  the  same  as  the  rf 
equations  for  the  initiation  stage  except  that  (i)  there  is  no 
growth  rate  and  (ii)  we  can  no  longer  equate  dyvox  to  w0/O 
[see  (8)].  Taking  this  into  account  we  then  have: 


[  ,  «0  2k 

(k2  -  "o)2  2 

\-k  -  — —  +  — 
n  (oe  a 

fl2 

+ 


2X'X 

flay* 


(31) 


<V>l.v  = 


A' 

—vu.-ikv\x, 

A 


(32) 


where  A  is 


A  =  \2-  to2. 


(33) 


Once  solutions  are  obtained  for  vlx  and  ulv,  then  the  other 
components  can  be  obtained  by: 


/  2knQ  /1q  2X'X/i0\ 

A  a)e  tOfA  ) ' 


(34) 


<f>[  =  vlxCl-ivly<oe, 


(35) 


(x)e 

4>\=-V\x—  +  iv 
k 


(36) 


Since  the  quantities  vXx  and  ylv  continue  to  interact  on  the 
slow  scale,  it  is  not  possible  to  separate  these  two  slow 
modes  into  separate  eigenvectors.  Rather  we  should  simply 
represent  them  as  a  single  eigenvector  with  two  degrees  of 
freedom.  Thus  with  vXx  and  i7Iv  satisfying  (31)  and  (32),  we 
take  this  eigenvector  to  be 


[nl'Vlx*V\Y'<f>\,4>\]A5o  ~v\x 


0,1,  o,a- 


c i)e 

k 


+  u,v 


2kn()  Hq 


Aox  ~  n 


2X'X/i0 

o)gA 


,0,1, 

(37) 


where  the  subscript  45o  indicates  that  this  is  the  eigenvector 
for  the  4,5  slow  modes. 

The  above  solutions  are  accurate  to  the  next  order  in  d()v 
and  are  valid  solutions  providing  one  is  not  too  near  the 

diocotron  resonance.  Within  a  distance  of  around 

the  diocotron  resonance,  these  two  slow  modes  will  interact 
with  the  drift  mode.  To  match  these  two  outer  solutions  to 
the  inner  solutions,  we  shall  require  the  limit  of  these  solu 
tions  as  one  approaches  the  diocotron  resonance. 

To  obtain  this  limit  we  will  scale  the  variables  as  fol¬ 
lows.  We  take  our  scale  parameter  to  be  the  small  unitless 
quantity,  6,  defined  by 

e-  WoyJdil,  (38) 

w'here  we  will  take  the  value  of  y0v  (see  Fig.  5)  to  be  that  at 
the  diocotron  resonance.  The  scaled  coordinate,  z,  will  be 
defined  from  coe  by 

Z  =  -(aJil)/€,  (39) 

so  that  z=0  at  the  resonance.  Let  v  be  the  unitless  derivative 
of  the  parallel  dc  drift  velocity,  defined  by 

v-i }qj/£L  (40) 

then  we  have 

kv 

dy-—dz.  (41) 

Now  if  we  take 

A*=\/n,  (42) 


then  it  follows  that 
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n0  =  fl2(/u2  +  v-  1), 

(43) 

n'0  =  kCl2(  2nxix+vx), 

(44) 

k'  =  kCl/xx, 

(45) 

v'ox  =  kClvx, 

(46) 

and  so  on.  The  subscript  \  in  the  above  indicates  the  deriva¬ 
tive  with  respect  to  ky.  This  is  so  that  we  do  not  have  to 
convert  these  dc  quantities  into  functions  of  z  since  they  are 
supposedly  of  order  unity  functions  of  ky.  Whence  we  have 
introduced  the  variable  x=ty  order  to  have  a  unitless  de¬ 
rivative  of  order  unity  for  these  quantities. 

Let  us  comment  that  v  is  essentially  the  scaled  density 
profile  at  the  diocotron  resonance.  Since  this  resonance  usu¬ 
ally  occurs  at  or  near  the  Brillouin  edge,  we  can  expect  the 
derivative  of  v  to  become  large  but  not  so  large  as  to  become 
of  order  Me.  Similarly  since  (l  is  the  scaled  value  of  \, 
which  is  the  “effective'’  cyclotron  frequency  inside  the  reso¬ 
nance,  we  can  expect  its  derivative  to  be  generally  small. 

First  we  scale  the  two  slow  modes  and  the  equations  that 
they  satisfy,  (3 1 )  and  (32),  so  that  we  can  obtain  approximate 
solutions  for  and  v\y  near  the  resonance.  We  do  this  so 
that  we  will  have  functional  forms  for  the  solution,  required 
in  the  matching  process.  From  (38)— (46),  for  this  scaling,  the 
ODE’s  (31)  and  (32)  become 

Pzp3  +  e(i—  +  2^A+O(e2)  =  0.  (47) 

\  v  (XV  ) 

f>J>2  +  ip 3—  -  iepii  -  -  —7- )  +  0(e1)=  0,  (48) 

Z  \v  jx  j 

where  p2zz^r  and  are  unitless  functions  of  Z-  Also 

/3q  is  a  constant  coefficient.  It  and  two  other  coefficients  (to 
appear  later)  are  defined  as 


v  — 

A) -2 

'A  Yx 

(49) 

V 

n  v  ’ 

1 

(N 

II 

<5 

[.tx  Yx 

(50) 

V 

H  v  ' 

2v- : 

A  = 

[Yx 

(51) 

The  general  solution  of  (47)  and  (48),  through  order  6, 
may  be  verified  to  be 

=  C4 In  z2  + 0(6),  (52) 


p3  =  C5  -  (6C4-  -  leC^-1  -  eQ^-fln  z2  -  2)  +  Ofe2), 

V  (lv  2v 

(53) 

where  C4  and  C5  are  the  constants  of  integration.  From  these 
solutions  one  may  now  construct  the  eigenvectors  for  these 
slow  modes.  From  the  above  and  (34M46),  taking  the  eigen¬ 


vectors  to  be  the  appropriate  coefficients  of  C4  and  C5,  we 
obtain 


~  n  zfi  n2  n2z 

-  A)ft  >  T--  ie—> ~r -  <77 

k  kv  k  k  v 


(54) 


X(ln  z2  -  2)-  e2 


2ilz(xx 

kfJLV 


n2  ~  n2  , 

ln-  l  -  v)-utp{) 

2k  k  v 


ll2z  ^  ~  ll2r 

X^-( In  z2  -  2  +  2v)  -2ie1— — (1  -  v)(i 


2krv 


k  (iv 


(55) 


where  the  error  in  each  component  is  the  next  order  in  e. 
Eigenvector  4  corresponds  to  the  component  of  the  slow 
mode  where  the  potential  vanishes  at  the  resonance  while 
eigenvector  5  corresponds  to  the  component  where,  in  lowest 
order,  the  potential  is  nonzero  at  the  resonance.  Thus  we  will 
refer  to  these  two  modes  respectively  as  the  “potential  gra¬ 
dient  mode”  and  the  “potential  mode”  due  to  their  structure 
at  this  resonance. 

Upon  taking  the  same  limit  for  the  third  outer  eigenvec¬ 
tor,  one  obtains 


e2  O 
k'lz2  k' 


eft2 
1 Z  k  ' 


z2  k2 


eify<*  3^ 


(56) 


where  we  have  taken  A^^e2  so  that  the  largest  component 
of  this  eigenvector  would  be  of  order  unity. 


V.  THE  INNER  SOLUTIONS  AT  THE  DIOCOTRON 
RESONANCE 

To  construct  inner  solutions  we  start  with  the  scalings  for 
the  quantities  <f>\\  as  given  by  (20)  and  (21)  and 

by  (38)-(46).  Thus  we  take 

V=[il2il/heihl/2/k,e1nil/^/ky€H2iJ/4/k,€2il2iJ/^/k2].  (57) 

where  the  ips  are  other  unitless  functions  of  z  to  be  deter¬ 
mined.  Substituting  the  above  into  ( 1 5)— ( 1 8)  and  using  (41) 
to  convert  y  derivatives  to  z  derivatives  as  needed,  one  ob¬ 
tains,  through  first  order  in  e,  the  five  scaled  equations 

A'/'i  =  *( <h  “  77(  1  -  n2  -  v)  -  eip3( 2/xxn  +  vx) 


+  ('A»-'/'2)(l  -P2-  i')/62  +  0(e2),  (58) 

i =  -  iz'fo  +  1  -  v)  +  ii/f5  +  0(e2),  (59) 

vr)zip3  =  -  izipi  +  (ip4~  tpzVe1  +  Oie2),  (60) 

vdzipA  =  ipi  +  Oie2),  (61) 

vPztp5=ip4.  (62) 


The  last  equation,  being  the  definition  of  </r4,  is  exact  to  all 
orders. 
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These  equations  reduce  to  a  third-order  system.  Starting 
with  (58)  and  (60),  one  deduces 

04=  <1*2+  e2Pi  +  0(e4),  (63) 

where  p4  is  another  function  of  z  to  be  determined.  From 
(59)  and  using  (63)  to  eliminate  04  from  (61),  one  obtains 

0i  =  -/z02  +  03(  1  -  i>) +  /05  +  0(62).  (64) 

Using  this  to  eliminate  0i  from  (58),  we  have 

p4  -  3  -  02 )  “  2 +  6K62).  (65) 

Then  upon  eliminating  p4  from  (60),  it  follows  from  (62)  that 

05  =  /03  +  C5  +  €p$  +  0(f2),  (66) 

where  C5  is  a  constant  and  p5  is  another  unknown  function  of 
Z.  From  (60)  we  obtain  02 

02  =  /  ^03  +  2/603/X^//^  +  Off2).  (67) 

Lastly  we  use  the  above  to  eliminate  all  variables  except  03 
and  from  (59)  and  (62),  to  obtain  the  third-order  system 

^<^03  +  [fz  +  y)]^03  +  lieztfojix/fi 

>6P5=C5+0(62),  (68) 


*>^5  =  2il/f3fJLxl/JL  +0(6). 


(69) 


The  solution  of  (68)  and  (69)  for  03  and  p5  may  be 
verified  to  be 

•  2 

Ps  =  Ci—/xx  +  Cj[izel9+  (/  -  zV^F,]-^*-  -  C5-^  +  O(e), 

(70) 
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where  C3  and  C4  are  additional  constants,  F,  is  the  complex 
conjugate  of  Fx  and 


6  = 


-dz. 

(72) 
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el0dz , 

) 

(73) 

eieF\dz. 

(74) 

In  lowest  order  F,  is  a  Fresnel  integral.  The  other  0s  may  be 
obtained  from  the  above  and  (63)— (67). 

With  this  we  have  three  linearly  independent  inner  solu¬ 
tions  which  correspond  to  the  drift  mode  and  the  two  slow 
modes.  Letting  the  independent  solutions  be  the  coefficients 
of  C3,  C4,  and  C5,  we  have 
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where  the  subscript  i  refers  to  the  inner  solution.  In  the  next 
section  we  will  match  the  inner  and  outer  solutions  and  de¬ 
termine  the  connection  coefficients. 


VI.  THE  CONNECTION  COEFFICIENTS 


In  order  to  match  the  inner  solutions  to  the  outer  solu¬ 
tions,  we  need  the  asymptotics  of  the  inner  solutions  in  the 
limit  of  z  large.  For  this  we  need  the  asymptotics  of  ft  Ft  and 
F2.  One  may  easily  verify  from  (38)-(40)  that  the  asymptotic 
of  6  for  large  z,  up  to  an  additive  constant,  is  just  the  phase 
of  the  drift  mode,  fya3dy.  The  asymptotics  of  Fl  will  be 
required  to  first  order  in  6.  Thus  it  will  be  necessary  to  in¬ 
clude  the  x  dependence  of  v  in  the  evaluation  of  Fx.  How¬ 
ever  the  asymptotics  of  F2  will  only  be  required  to  zeroth 
order  in  6,  in  which  case  we  may  ignore  the  variation  of  v 
with  respect  to  \- 

As  shown  in  the  Appendix,  in  the  limit  of  |z|  large,  the 
asymptotics  of  these  integrals  are: 
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where  is  the  value  of  v  at  the  resonance  (z=0)  and  y 
=0,57721...  is  Euler's  constant. 

Let  us  evaluate  the  connection  coefficients  in  the  follow¬ 
ing  manner.  Start  with  a  linear  superposition  of  the  three 
outer  solutions  below  the  resonance  (z<0)  of  the  form 

V = c(3-V3(,  +  c<-V4„  +  dfV5„,  (80) 

where  Cj’1  is  the  amplitude  of  the  mh  eigenvector  and  the 
superscript  (-)  indicates  that  this  is  the  amplitude  below  the 
resonance.  We  take  the  form  of  the  interior  solution  to  be 

V=D3V3i  +  DAVA!  +  D5V5i,  (81) 

where  Dn  is  the  amplitude  of  the  mh  interior  solution.  Then 
upon  inserting  (78)  and  (79)  into  (75)— (77),  for  z  large  and 
negative,  and  upon  comparing  the  result  with  (54M56),  one 
finds  that  in  lowest  order  of  e, 
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Repeating  the  same  for  z  large  and  positive,  we  take  the 
outer  solution  to  be  the  linear  superposition 


v=d,+V3o+d+V4„+c^V. 


(86) 


where  the  superscript  (+)  indicates  that  this  coefficient  is  for 
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Combining  (82)— (84)  with  (87)— (89),  upon  eliminating 
the  Ds  we  then  have 


cf  =  C(3->  +  (3oC{5-\2TTVoe‘nM  +  0(e). 
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(91) 

(92) 


Some  physical  understanding  to  these  results  can  be  ob¬ 
tained  by  noting  the  following  points.  Recall  that  J3(h  (49),  is 
essentially  the  negative  of  the  logarithmic  derivative  of  the 


density  with  respect  to  and  as  such,  it  is  only  significantly 
nonzero  at  the  edge  of  the  sheath.  Away  from  the  edge  we 
may  take  fi0  to  be  essentially  zero.  Thus  when  p0^  0,  we  see 
that  the  amplitudes  of  the  drift  mode  and  potential  mode 
would  be  transmitted  across  the  resonance  without  change. 
Meanwhile  the  potential  gradient  mode  would  always  be 
modified,  picking  up  an  additional  part  proportional  to  the 
amplitude  of  the  drift  mode.  On  the  other  hand  when  the 
resonance  is  near  the  edge  such  that  fi0  is  significantly  non¬ 
zero,  then  the  potential  mode  always  modifies  the  other  two 
modes,  although  itself  is  transmitted  across  the  resonance 
unchanged. 

Let  us  review  the  nature  of  these  three  modes  in  the 
outer  region  (\z\  >  1).  From  (56)  we  see  that  the  drift  mode 
basically  only  has  fast  density  oscillations  as  long  as  z  is  in 
the  outer  region.  The  potential  gradient  mode,  (54),  has  a  rf 
density  “bulge"  at  the  resonance,  on  the  slow  scale,  propor¬ 
tional  to  /70.  It  also  has  a  slow  rf  potential  gradient  and  es¬ 
sentially  a  vanishing  value  for  the  potential  at  the  resonance. 
On  the  other  hand  the  potential  mode,  (55),  is  more  complex, 
but  basically  is  the  only  mode  with  a  nonzero  value  for  the  rf 
potential  at  the  resonance. 

Now  consider  the  following.  Potentials  are  more  difficult 
to  shift  than  potential  gradients,  which  could  then  be  taken  as 
an  explanation  for  why  the  potential  mode  tends  to  be  so 
resilient  and  transmitted  across  the  resonance  unchanged.  (To 
justify  this  explanation,  note  that  without  work  being  done, 
one  can  shift  the  charge  distribution  inside  the  plasma  such 
that  one  shifts  the  potential  gradient  but  not  the  overall  po¬ 
tential.)  Consider  when  we  have  an  absence  of  a  density 
gradient  at  the  resonance  and  thus  /Sq^O.  The  drift  mode, 
containing  essentially  only  fast  density  oscillations,  would 
tend  to  simply  transmit  unchanged  across  the  resonance. 
However  as  this  drift  mode  crosses  the  resonance,  since  the 
WKB  wave  vector  a ^  will  cross  zero  here,  its  fast  oscilla¬ 
tions  will  become  slower  and  slower  and  eventually  reverse. 
Thus  at  the  resonance,  this  mode  would  leave  behind  a  non¬ 
zero  average  for  the  rf  potential  gradient,  thereby  contribut¬ 
ing  to  the  potential  gradient  mode.  On  the  other  hand  since 
its  density  average  is  zero  it  would  not  significantly  contrib¬ 
ute  to  the  potential  mode.  Now  let  the  resonance  be  located 
near  the  edge  of  the  sheath  so  that  /?()  =£  0.  Here  we  have  an 
additional  term  coming  into  play  which  is  the  coupling  pro¬ 
vided  by  the  dc  density  gradient  at  the  edge  of  the  sheath  as 
in  (52)  and  (53).  This  dc  density  gradient,  combining  with 
the  potential  mode,  then  generates  contributions  to  both  the 
drift  mode  and  the  potential  gradient  mode. 

VII.  CONCLUSIONS 

We  have  shown  that  in  the  saturation  stage  the  rf  com¬ 
ponent  of  the  non-neutral  plasma  inside  a  magnetron  be¬ 
comes  a  fifth-order  system  This  fifth-order  system  consists 
of  two  slow  modes,  two  fast  drift-cyclotron  modes  (Slater 
orbits)  and  one  fast  drift  mode.  The  three  fast  modes  are  not 
found  in  the  initiation  stage  since  their  existence  requires  the 
dc  drift  velocity  to  have  a  component  directed  from  the  cath¬ 
ode  to  the  anode.  Although  the  current  generated  by  this 
vertical  dc  velocity  is  only  a  small  fraction  of  the  total  cur- 
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rent  circulating  in  the  device,  it  is  exactly  this  smallness 
which  creates  a  dramatic  change  in  this  structure  of  the  rf 
solutions.  In  particular  this  vertical  dc  current  causes  reso¬ 
nance  layers  to  occur  at  the  position  of  any  resonance  in  the 
plasma,  inside  of  which  mode  conversions  between  these 
five  modes  can  occur.  These  resonances  were  already  in  the 
initiation  stage:  (1)  the  diocotron  resonance  at  cje=0 ,  and  (2) 
the  two  cyclotron  resonances  at  cu,=  ±0.  The  structure  of 
these  resonances  occurs  due  to  the  small  parameter  (vertical 
velocity)  multiplying  three  of  the  derivatives  of  the  fifth- 
order  set  of  ODEs.m 

As  we  have  already  seen  in  Fig.  6  these  fast  modes  pre¬ 
vent  one  from  obtaining  rapid  numerical  solutions  in  this 
stage.  Now  that  the  structure  and  location  of  these  reso¬ 
nances  are  known,  and  the  structure  and  widths  of  the  reso¬ 
nance  layers,  it  will  be  possible  to  develop  numerical  code  to 
rapidly  solve  this  fifth-order  system.  For  example  if  there 
was  no  magnetron  resonance  inside  the  plasma  then  the  drift- 
cyclotron  modes  never  cross  the  other  resonances.  In  this 
case  the  fifth-order  system  can  be  reduced  to  a  third -order 
system  and  two  first-order  systems  resulting  in  a  very  signifi¬ 
cant  decrease  in  the  required  computational  time  for  the 
drift-cyclotron  modes.  In  effect  what  one  can  do  is  to  use  the 
WKB  approximation  (followed  by  a  perturbation  expansion) 
to  separate  the  fifth-order  system  into  three  first-order  ODE’s 
and  one  second-order  ODE,  each  of  which  could  be  rapidly 
computed  since  the  fast  variations  could  be  analytically  sepa¬ 
rated  out.  The  only  problem  would  be  in  the  region  of  any 
resonances  where  one  would  have  to  appropriately  couple 
the  ODEs  and  carefully  numerically  integrate  through  the 
resonance.  However  such  can  be  done:  one  can  separately 
propagate  each  fast  mode  from  the  cathode  until  it  either 
reaches  the  anode  or  crosses  the  two  slow  modes.  When  the 
latter  occurs  one  simply  does  a  separate  computation  around 
the  resonance. 

Then  there  is  the  question  of  boundary  conditions  on  the 
rf  fields.  In  the  initiation  stage  one  only  requires  two  bound¬ 
ary  conditions:  (i)  the  vanishing  of  the  rf  potential  at  the 
cathode  and  (ii)  specifying  the  magnitude  of  the  rf  density 
oscillations  at  the  anode.  The  former  follows  whenever  the 
cathode  is  a  smooth  conductor  and  the  latter  is  generally 
determined  by  the  dc  density  profile.  (Note  that  since  the 
electron  density  can  never  be  negative,  the  amplitude  of  the 
rf  density  oscillations  must  always  be  less  than  the  dc  elec¬ 
tron  density."0  Taking  these  to  be  equal  has  the  effect  of 
coupling  the  rf  amplitude  at  the  anode  to  the  dc  amplitude  at 
the  anode.)  For  the  saturation  stage  three  additional  boundary 
conditions  will  be  required.  It  seems  reasonable  that  one 
should  take  these  three  conditions  to  be  the  vanishing  of  all 
fast  modes  at  the  cathode.  This  would  follow  from  recent 
results 10-13  on  the  stability  of  various  non-neutral  flows. 
However  it  should  also  be  mentioned  that  various  recent 
methods  for  breaking  up  the  laminar  flow  at  the  cathode,  as 
in  Ref.  27,  could  be  found  to  be  a  method  for  creating  non¬ 
zero  sources  of  fast  modes  at  the  cathode. 

In  the  usual  situation  of  a  smooth  cathode  with  no  mag¬ 
netron  resonance  present,  the  rf  structure  in  the  saturation 
stage  becomes  simpler  since  only  the  diocotron  resonance 
exists.  Taking  the  fast  modes  to  be  zero  at  the  cathode,  it 
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follows  that  both  drift-cyclotron  modes  will  be  absent.  Also 
the  drift  mode  will  be  zero  between  the  cathode  and  the 
location  of  the  diocotron  resonance.  Now  we  apply  (90)-(92) 
and  find  that  above  the  diocotron  resonance  the  drift  mode 
will  exist  with  /SqC^.  Thus  the  total  rf  field  at  the 

anode  would  then  consist  of  a  drift  mode  and  two  slow 
modes  and  the  ponderomotive  terms  in  (6)-(9)  would  consist 
of  two  parts:  the  usual  contribution  from  the  slow  modes  and 
an  additional  contribution  from  the  drift-cyclotron  mode  cre¬ 
ated  at  the  diocotron  resonance  from  the  slow  modes. 

How  important  are  these  three  fast  modes  to  the  genera¬ 
tion  of  the  rf  power  of  a  magnetron?  At  the  moment  we 
cannot  assess  this  quantitatively  since  we  still  are  to  obtain 
self-consistent  saturation  stage  solutions  for  dc  and  the  rf 
fields  [i.e.  satisfying  (6)— ( 1 3)].  We  do  know  that  the  growth 
rate  vanishes  when  the  device  is  in  the  saturation  stage.  We 
do  know  that  if  we  start  with  the  dc  field  of  the  initiation 
stage  that  we  obtain  the  rf  solutions  discussed  herein.  We 
would  expect  the  dc  solution  of  the  saturation  stage,  satisfy¬ 
ing  (6)-(9),  to  be  qualitatively  similar  to  that  of  the  initiation 
stage  as  given  in  Figs.  3-5.  We  do  know  that  these  devices 
generally  operate  only  when  the  diocotron  resonance  occurs 
at  or  near  the  middle  of  the  density  profile  and  where  the 
density  gradient  is  large.  We  do  know  that  if  the  parameters 
are  outside  this  range  then  the  device  will  not  operate.  In  the 
latter  case  1  and  per  (90M92),  essentially  no  drift  mode 
would  be  produced.  What  we  do  not  know  is  the  range  of 
values  for  /30  in  (90).  One  can  argue  from  (30)  and  (37)  that 
for  1,  the  amplitude  of  the  rf  electric  field  of  the  drift 
mode  at  the  anode  could  even  exceed  that  of  the  slow  modes. 
It  all  depends  on  the  sharpness  of  the  edge  of  the  electron 
sheath  in  the  saturation  stage.  All  these  points  are  also  con¬ 
sistent  with  the  statement:  a  magnetron  will  only  operate 
when  a  significant  drift  mode  has  been  generated  at  the  dio¬ 
cotron  resonance  from  the  slow  modes.  Thus  it  certainly  ap¬ 
pears  that  the  drift  mode  is  a  key  component  in  the  operation 
of  a  magnetron.  We  will  understand  more  about  this  after 
self-consistent  solutions  for  the  saturation  stage  are  obtained 
following  along  the  lines  of  the  self-consistent  calculations 
done  for  the  initiation  stage  9 

In  this  paper  we  have  detailed  the  five  independent  so¬ 
lutions  in  the  region  of  the  diocotron  resonance.  We  have 
determined  the  conversion  coefficients  for  the  three  modes 
concerned  and  have  given  a  general  explanation  for  their 
structure.  In  particular,  the  amplitude  of  the  potential  mode 
remains  unaffected  in  crossing  the  resonance  while  the  drift 
mode  and  the  potential  gradient  mode  pick  up  additional 
contributions  to  their  amplitudes  from  the  potential  mode 
with  the  coupling  being  proportional  to  the  dc  density  gradi¬ 
ent.  The  drift  mode  will  always  add  a  contribution  to  the 
potential  gradient  mode  due  to  the  break  in  the  potential 
gradient  of  the  drift  mode  at  the  resonance. 

In  summary,  we  have  studied  the  nature  of  the  rf  solu¬ 
tions  of  the  steady-state  magnetron  when  it  is  steadily  deliv¬ 
ering  rf  power,  particularly  the  rf  modes  in  the  non-neutral 
plasma  in  the  neighborhood  of  the  diocotron  resonance.  We 
have  found  that  in  this  neighborhood  the  ordinary  differential 
equations  for  the  rf  fields  are  nonsingular  and  that  nonsingu¬ 
lar  solutions  do  exist  for  the  rf  quantities.  This  resonance 
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FIG.  9.  The  contour  in  the  complex  plane  for  evaluating  F,(z)  in  the  limit  of 
large  z. 


layer  is  relatively  thin  and  in  general  the  rf  fields  will  rapidly 
vary  as  they  pass  through  the  resonance  in  a  manner  similar 
to  what  occurs  at  boundary  layers.  During  the  passage  of  the 
slow  modes  and/or  the  drift  mode  through  this  resonance, 
conversion  from  one  mode  to  another  will  occur  with  the 
conversion  coefficients  being  given  by  (90)-(92). 
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APPENDIX:  ASYMPTOTIC  LIMITS  OF  INTEGRALS 


=0,  and  the  value  of  the  integral  is  then  found  to  be 
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where  v0  is  the  value  of  v  at  the  center  of  the  resonance. 
Since  F j  will  be  the  negative  of  the  sum  of  these  two  inte¬ 
grals,  we  have  that  in  the  limit  where  z  is  large  and  positive. 
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To  evaluate  the  limit  of  F2(z)  for  large  |z|,  we  only  need 
the  limit  to  order  unity  and  thus  we  take  v  to  be  a  constant.  In 
this  case,  from  (74)  and  (Al),  F2(z)  may  be  given  by 


(A5) 


This  double  integral  may  be  evaluated  by  transforming  to  a 
hyperbolic  coordinate  system.  Let  us  take  u=w  cosh  if  and 
v-w  sinh  ip.  Then  the  above  integral  can  be  transformed  into 

W  =  /,(z)  +  /2(z),  (A6) 

where 


In  this  Appendix  we  will  derive  the  asymptotic  limits  of 
F|  and  F2  for  |z|  — 

Starting  with  F(,  note  that  we  will  need  this  result  to 
order  e.  Thus  we  cannot  just  simply  replace  F]  by  a  Fresnel 
integral,  but  we  must  consider  the  first-order  variations  of  v 
with  respect  to  x •  As  long  as  v  does  not  significantly  change 
over  the  region  where  |z|  <  1  /  Ve,  we  may  consider  v  to  be 

an  analytic  function  of  z  and  use  a  contour  integral  to  evalu¬ 
ate  the  asymptotic  value  of  Fx.  For  clarity  we  introduce 
dummy  variables  w  and  u  and  rewrite  (72)  and  (73)  as 

(Kz)  =  f  F,(w)=  f  emdz.  (Al) 

Jo  Kw)  J0 


Consider  the  integral  of  Je'^dz  around  the  contour 
shown  in  Fig.  9  for  w  large  and  positive.  It  can  be  seen  that 
F|(w)  is  the  integral  along  the  real-axis  segment  of  the  con¬ 
tour.  By  assumption  6  will  be  analytic  inside  the  contour. 
Thus  there  will  be  no  residue  and  the  value  of  the  contour 
integral  will  be  zero.  Thus  the  value  of  the  real-axis  segment 
will  be  the  negative  of  the  contributions  along  the  other  two 
segments. 

Along  the  arc,  the  major  contribution  to  the  integral  will 
come  from  the  region  of  the  arc  next  to  the  real  axis.  To 
evaluate  the  integral  along  this  arc  we  expand  (Kz )  in  a  Tay¬ 
lor  series,  expanded  about  z=w.  Then  evaluation  of  this  in¬ 
tegral  gives 


i  €l/y  A  1 

--~f +0  — 

w  w  \  w 


eitKw)  +  Oie2). 


(A2) 


Along  the  ray  the  major  contribution  to  that  segment  of  the 
contour  integral  will  come  from  the  region  next  to  the  origin. 
Here  we  expand  0(z)  in  a  Taylor  series,  expanded  about  z 


P  ziiid  ( z\ 

z)  =  J  we  2v  ln(  —  Jdw , 

p 

z)  -  we  2v 

J  o 


ln(  1  +  \j\  -  —  jdw . 


(A7) 


(A8) 


Integrating  by  parts  on  the  first  integral,  to  eliminate  the 
ln(z/w)  term  gives 


(A9) 


where  the  last  integral  is  listed  in  Ref.  33.  In  the  limit  of 
large  \z\  this  gives 


/,(z) 


(A10) 


up  to  exponentially  small  terms.  In  the  above,  y 
=  0.57721...  is  Euler’s  constant. 

For  /2,  we  first  transform  variables  from  w  to  p  where 
w2=z2(  1  -p2)  and  then  we  perform  an  integration  by  parts  to 
eliminate  the  In  term.  This  gives 

. ,2  ri  ei&£ 

I2(z)  =  -  iv\n  2  +  ive'tl  I  — —dp.  (All) 

Jo  1  +P 


The  major  contribution  to  the  remaining  integral  comes  from 
the  region  near  p= 0,  so  we  replace  j—  by  1  under  the 
integral.  When  that  is  done,  the  first  term  becomes  a  Fresnel 
integral  and  can  be  evaluated  as  with  Fx(z)  above.  The  sec¬ 
ond  term,  under  the  integral  can  be  shown  to  be  bounded 
by  an  order  of  1  fz 2  for  large  \z\.  Thus  we  have 
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I2(z)  =  -i  v 


7 TV  /  1 

In  2-  \I  —e  2ve  4  +  0\  — 


(A  12) 


Combining  the  above  results  for  (A  10)  and  (A  12)  into 
(A6)  gives 
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for  \z\  large. 
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Abstract 

Variational  problems  have  long  been  used  to  mathematically  model  physical  systems.  Their  advantage  has  been  the  simplicity  of  the  model  as 
well  as  the  ability  to  deduce  information  concerning  the  functional  dependence  of  the  system  on  various  parameters  embedded  in  the  variational 
trial  functions.  However,  the  only  method  in  use  for  estimating  the  error  in  a  variational  approximation  has  been  to  compare  the  variational  result 
to  the  exact  solution  In  this  Letter,  we  demonstrate  that  one  can  computationally  obtain  estimates  of  the  errors  in  a  one-dimensional  variational 
approximation,  without  any  a  priori  knowledge  of  the  exact  solution.  Additionally,  this  analysis  can  be  done  by  using  only  linear  techniques.  The 
extension  of  this  method  to  multidimensional  problems  is  clearly  possible,  although  one  could  expect  that  additional  difficulties  could  arise 
©  2007  Elsevier  B  V.  All  rights  reserved 


1.  Introduction 

In  the  theory  of  modem  dynamics,  there  exist  two  para¬ 
digmatic  views  on  formulating  the  mathematical  model  for  a 
physical  system.  These  paradigms  arrive  at  a  completely  equiv¬ 
alent  result.  The  first  perspective,  commonly  viewed  as  the 
Newtonian  approach,  argues  that  the  evolution  of  the  state  of 
a  process  can  be  completely  described  in  terms  of  the  forces 
involved  with  the  process.  This  approach  is  a  manifestation  of 
the  Aristotelian  “cause  and  effect”  view  of  the  universe.  The 
mathematical  framework  for  the  second  perspective  ts  based 
on  two  publications  [1,2]  in  the  1830s  by  William  Hamilton. 
Within  these  papers,  Hamilton  proposes  a  theory  of  dynamics 
which  describes  all  of  classical  (Newtonian)  mechanics  without 
the  use  of  forces.  Instead*  the  physical  process  under  consider¬ 
ation  will  evolve  in  such  a  way  as  to  extremize  the  integral  of 
the  difference  between  the  kinetic  and  potential  energies.  Philo¬ 
sophically,  this  is  in  stark  contrast  to  the  Newtonian  perspective. 
Instead  of  viewing  the  evolution  of  a  process  as  a  result  of  ex- 
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temal  influences,  the  Hamiltonian  approach  suggests  there  is  an 
internal  intent  on  the  part  of  the  process  to  evolve  in  a  particular 
manner. 

Variational  problems  have  existed  in  mathematics  through¬ 
out  history.  The  earliest  that  we  have  located  was  in  the  Aeneid 
[3],  where  there  is  a  legend  of  a  woman  who  fled  to  Africa  af¬ 
ter  her  husband  was  murdered.  She  arrived  somewhere  along 
the  North  African  coast  where  she  pleaded  with  the  local  ruler. 
King  Jambus,  for  land.  The  King  granted  the  woman  as  much 
land  as  could  be  enclosed  with  the  hide  of  a  bull.  Legend  says 
that  the  clever  lady  cut  the  hide  into  very  thin  and  long  strips 
and  laid  it  out  to  enclose  the  maximum  possible  area  (a  circle). 
The  land  later  became  the  city  of  Carthage  (c.  814  BC)  and 
the  woman  its  ruler.  Queen  Dido.  While  legend  and  fact  may 
not  necessarily  agree,  this  illustrates  that  the  idea  of  extremum 
principles  has  existed  for  millenia.  Eight  centuries  later.  Hero  of 
Alexandria  (c.  10-70)  proved  the  first  recorded  scientific  min¬ 
imum  principle.  He  was  able  to  show  that  the  trajectory  of  a 
reflected  light  ray  is  a  minimum  if  the  angles  of  incidence  and 
reflection  are  equal  [4].  This  idea  was  later  formulated  as  a  least 
time  principle  by  Fermat  in  the  early  part  of  the  seventeenth 
century  [4]. 

The  mathematical  machinery  necessary  to  investigate  such 
problems  was  found  to  require  more  than  the  elementary 
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calculus  developed  by  Newton  [5]  and  Leibniz  in  the  mid¬ 
seventeenth  century.  In  1696,  Johann  Bernoulli  proposed  "... 
brachistochrone  problem  which  can  be  summarized  as  ‘ //  two 
points  are  connected  by  a  wire  whose  shape  is  given  by  an  un¬ 
known  function  y(jc)  in  a  vertical  plane ,  what  shape  function 
minimizes  the  time  of  descent  of  a  bead  sliding  without  friction 
from  the  higher  to  the  lower  point  *  ”  [6].  This  problem  was  ad¬ 
dressed  by  several  of  Bernoulli’s  contemporaries,  and  a  new 
type  of  calculus  arose  from  those  investigations.  This  calculus 
is  known  today  as  the  calculus  of  variations  and  was  devel¬ 
oped  into  a  full  mathematical  theory  by  Euler  around  1744  [7]. 
The  mathematics  developed  by  Euler  was  extended  by  Joseph- 
Louis  Lagrange  (1736-1813).  Lagrange  discovered  that  Euler’s 
equation  (later  to  be  named  the  Euler-Lagrange  equation)  for 
minimizing  a  functional  integral  could  be  expressed  in  a  com¬ 
pact  way  by  simply  using  integration  by  parts.  It  was  Lagrange 
who  introduced  the  integrand  of  the  functional  appropriate  to 
mechanics,  i.e.,  the  difference  between  kinetic  and  potential  en¬ 
ergies.  Euler  had  essentially  only  considered  the  kinetic  energy 
[8].  This  approach  amounted  to  requiring  additional  conditions 
to  get  a  complete  formulation  of  classical  mechanics. 

Since  the  solution(s)  of  a  dynamical  system  is  one  which 
extremizes  this  quantity  called  the  action,  it  therefore  follows 
from  this  principle  that  one  can  also  devise  methods  for  find¬ 
ing  approximate  solutions.  Thus,  one  can  reduce  the  number 
of  degrees  of  freedom  in  which  the  function  can  be  varied  and 
Hamilton’s  Principle  will  lead  one  to  a  solution  which,  in  some 
sense,  will  be  as  close  as  possible  to  the  correct  solution.  Due  to 
the  reduction  in  the  number  of  degrees  of  freedom,  one  cannot 
generally  expect  the  function  to  achieve  the  actual  extremum  of 
the  full  problem,  “...these  methods  aim  to  “ project ”  the  full 
dynamics  onto  a  finite-mode  space,  or,  in  other  words,  truncate 
a  system  with  infinitely  many  degrees  of  freedom  to  a  finite • 
dimensional  one ”  [9].  However,  one  can  still  expect  to  find 
an  extremum  which  should  be  “near”  the  exact  solution.  This 
method  has  long  been  used  in  various  applications  ranging  from 
geometry  [10]  to  quantum  mechanics  [11,12].  Tn  particular,  it 
was  in  this  period  that  we  have  the  “Raleigh-Ritz  method”  be¬ 
ing  developed  independently  by  Raleigh  and  Ritz,1  for  finding 
eigenfunctions  and  eigenvalues  of  linear  differential  equations. 
This  method  is  known  in  modem  mathematical  literature  as  the 
Raleigh-Ritz  method  [13,14].  In  fact,  this  usage  of  variational 
methods  continued  into  the  first  half  of  the  20th  century,  and 
even  up  into  the  late  1980’s.  During  this  time,  quantum  me¬ 
chanics  and  all  of  modem  physics  were  bom  and  with  these, 
there  came  a  critical  need  to  be  able  to  obtain  numerical  values 
to  compare  with  experiments.  Up  until  the  1950s  there  were 
no  electronic  computers;  even  into  the  latter  half  of  the  20th 
century,  such  electronic  devices  were  generally  only  available 
at  large  government  labs  or  universities  where  their  usage  was 
generally  quite  limited.  Otherwise,  one  only  had  slide-rules  or 


1  The  authors  were  not  able  to  identify  historical  records  of  the  work  done 

by  Raleigh  and  Ritz,  only  that  the  name  of  the  method  seemed  to  have  been 
adopted  sometime  during  the  late  19th  Century,  which  was  the  time  period  in 
which  Raleigh  and  Ritz  performed  their  work. 


crude,  electric-operated,  mechanical  calculators  for  use.  Thus, 
there  was  a  definite  need  for  the  development  of  methods  to 
obtain  some  type  of  approximate  solutions  consistent  with  the 
constraints  imposed  by  these  computational  devices.  Conse¬ 
quently  the  power  of  analytical  methods  of  mathematics  was 
put  into  use  with  these  devices.  As  an  answer  to  this  need,  there 
were  major  efforts  to  utilize  variational  methods  for  develop¬ 
ing  approximate  solutions,  as  well  as  perturbation  expansions  of 
such,  for  various  physical  systems  [15].  We  note  that  in  this  time 
period  nonlinear  problems  were  generally  not  studied  as  such, 
except  to  the  degree  that  one  could  expand  about  some  solv¬ 
able  linear  problem,  or  some  known  analytical  solution.  Thus, 
it  is  not  surprising  that  all  the  work  in  this  period  mainly  con¬ 
centrated  on  linear  eigenvalue  problems  and  their  perturbations, 
with  variational  methods  playing  leading  roles. 

The  need  for  this  development  of  variational  methods  began 
to  decline  after  the  1970s;  the  availability  of  calculators  and 
computers  and  particularly  the  advent  of  the  personal  computer 
and  the  accompanying  increase  in  computational  power  that 
came  with  these  devices  and  their  larger  brothers  This  marked 
the  end  of  an  era  and  the  beginning  of  a  new  one.  Within  a 
decade,  it  would  no  longer  be  important  to  carefully  and  ana¬ 
lytically  expand  solutions  of  equations  of  motion,  in  order  to 
obtain  a  three-to-four  place  accuracy  in  their  numerical  values, 
when  with  the  touch  of  a  few  keys,  the  value  desired  would  ap¬ 
pear  almost  instantly  and  with  an  8-to-14  place  accuracy. 

Nevertheless,  even  though  general  computations  no  longer 
needed  to  utilize  variational  methods,  there  have  always  been 
other  aspects  of  variational  methods  which  current  computa¬ 
tional  power  cannot  hope  to  match.  In  particular,  variational 
methods  have  always  been  adept  at  providing  functional  de¬ 
pendences  and  trends,  and  thereby  a  single,  simple  variational 
result  could  contain  more  information  than  dozens  of  pages 
of  numerical  computations.  For  example,  when  a  nonlinear 
problem  contains  three  parameters  and  is  also  a  differential 
system  of  twelfth-order,  one  would  like  to  be  able  to  local¬ 
ize  (as  much  as  possible)  the  parameter  regime  wherein  one 
would  need  to  search  for  soliton  solutions.  This  can  be  done 
with  variational  methods  [16].  Furthermore,  variational  tech¬ 
niques  developed  in  numerical  methods  (such  as  the  Galerkin 
method)  have  also  been  applied  to  develop  variational  meth¬ 
ods  for  dissipative  systems  [17]  and  also  to  study  the  parame¬ 
ter  space  of  electron  devices  [18].  Indeed,  variational  methods 
have  been  used  to  find  approximate  solutions  in  various  cases, 
particularly  in  cases  where  closed-form  solutions  would  not 
generally  exist  [16,17,19-23].  The  most  recent  comprehensive 
review  of  current  applications  and  successes  of  the  variational 
method  has  been  given  by  Malomed  [9]  and  is  particularly  rec¬ 
ommended  as  a  reference  on  variational  methods.  However, 
in  spite  of  the  successes  that  have  been  achieved,  variational 
methods  can  also  lead  one  to  uncertain  or  wrong  results  [18, 
22,24]2  and  thus,  there  is  also  a  need  for  an  even  deeper  and 


-  We  note  that  Ref.  [24]  had  used  the  variational  approximation  to  make  pre¬ 
dictions  on  the  stability  of  gap  solitons.  and  had  found  that  all  models,  including 
the  integrable  Massive  Thirring  model,  contained  parameter  regimes  in  which 
the  solitary  wave  solutions  were  unstable.  Integrable  systems  are  not  normally 
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more  through  understanding  of  these  methods.  In  particular, 
there  is  a  need  to  be  able  to  discern  when  a  variational  method 
is  not  “sufficiently  close”  to  serve  as  an  adequate  approxima¬ 
tion. 

At  this  point,  let  us  define  some  nomenclature.  By  “Hamil¬ 
ton’s  Principle”,  we  mean  the  general  principle  which  states 
that  the  equations  of  interest  can  be  obtained  by  the  extrem- 
ization  of  some  functional  quantity,  called  the  “action”  The 
equations  obtained  are  called  the  Euler-Lagrange  equations. 
The  action  is  obtained  by  integrating  (summing)  over  an  appro¬ 
priate  Lagrangian  By  “variational  method”  we  mean  a  method 
that  makes  use  of  Hamilton’s  Principle  in  some  manner.  When 
one  uses  Hamilton’s  Principle  to  find  approximate  solutions, 
we  shall  refer  to  the  solution  found  as  the  “variational  approxi¬ 
mation”  (VA).  We  note  that  the  Raleigh-Ritz  method  also  uses 
Hamilton’s  Principle  to  obtain  a  VA.  The  term  VA  is  intended 
to  be  an  inclusive  term  and  to  include  applications  to  nonlin¬ 
ear  systems  as  well  More  general  variational  approaches,  such 
as  those  used  in  Ref.  [17],  should  perhaps  be  distinguished  and 
called  “Galerkin  variational  methods”  due  to  the  approach  used 
to  construct  the  Lagrangian.  We  shall  not  otherwise  discuss 
those  methods  here.  However,  the  reader  should  be  aware  that 
what  we  shall  present  here  can  easily  be  extended,  at  least  in 
principle,  to  all  variational  methods. 

The  major  criticism  of  any  use  of  the  VA  has  generally 
concentrated  on  the  fact  that  there  was  no  general  method  for 
estimating  the  quantitative  validity  of  the  VA,  except  by  com¬ 
parison  with  the  exact  solution.  With  the  VA,  one  has  obtained 
an  approximate  solution.  But  how  could  one  determine  whether 
or  not  this  approximate  solution  was  of  sufficient  accuracy  to  be 
of  value?  In  the  absence  of  having  the  exact  solution,  one  had  no 
way  to  answer  this.  This  fact  has  always  caused  one  to  hesitate, 
even  if  only  momentarily,  before  using  or  quoting  variational 
results,  despite  its  many  frequent  successes.  This  hesitation  has 
even  been  addressed  in  literature  pertaining  to  variational  ap¬ 
proximations.  “ ...  As  for  the  necessity  to  verify  the  validity  of 
the  results  against  direct  simulation,  this  does  not  devaluate  the 
VA,  since  it  is  frequently  sufficient  to  perform  the  comparison 
at  a  few  different  values  of  the  problem's  control  parameters.  If 
the  comparison  at  several  benchmark  points  corroborates  the 
applicability  of  VA,  then  its  (semi-)analytical  predictions  are  re¬ 
liable  enough  to  describe  solitons  in  broad  parametric  regions  ” 
[9].  What  is  lacking  is  a  general  technique  for  the  quantitative 
evaluation  of  any  VA. 

As  mentioned  above,  in  the  VA  one  expects  to  find  a  solution 
“close”  to  the  exact  solution.  But  how  close  is  “close”?  The 


unstable,  whence  their  result  on  lhe  Massive  Thnring  model  was  quile  surpris¬ 
ing.  Afterwards,  Ref.  [22]  poinied  oul  how  a  varialional  approximalion  could 
indeed  produce  an  unslable  resull  for  a  stable  sysiem  ,  Thus  lhe  blame  for  the 
mistake  seemed  to  be  due  lo  the  varialional  method.  However  later  Ref.  [27] 
verified  that  the  instabilily  resulis  of  Ref  [24]  were  indeed  true,  up  to  the  one 
excepliorr  lhai  being  the  iniegrable  Massive  Thirring  model,  whose  solitons 
were  all  found  lo  be  actually  stable.  Thus  Ref.  [22]  was  indeed  correci  in  lhat  the 
VA  of  Ref.  [24]  did  mispredicl  the  siability  of  the  integrable  Massive  Thirring 
model  and  demonsirated  how  such  could  occur.  On  the  other  hand,  and  not  in 
conflicl  wilh  this.  Ref,  [24]  did  indeed  correclly  predici  the  general  siability 
properties  of  those  gap  solilons  belonging  lo  the  non-iniegrable  models. 


key  to  determining  this  is  to  return  to  the  early  20th  century 
when  variational  methods  were  being  used  with  great  intensity. 
This  was  so  much  the  case  that  even  perturbation  methods  for 
VAs  were  being  developed  [15].  Certainly  it  would  have  been 
generally  known  that  the  size  of  a  term  at  any  one  given  order 
would  have  given  an  estimate  of  the  accuracy  at  that  order.  And 
certainly  this  would  have  been  used  to  determine  to  how  many 
orders  one  needed  to  expand  At  the  same  time,  this  clearly  tells 
us  how  to  quantitatively  estimate  any  VA,  What  has  to  be  done 
is  to  formulate  the  VA  as  the  leading  term  in  a  perturbation 
expansion.  Then  provided  one  is  expanding  about  a  function 
which  is  sufficiently  close  to  the  exact  solution,  any  term  in  the 
perturbation  expansion  will  indicate  the  resulting  error  if  the 
series  is  terminated  at  that  term. 

Note  that  we  are  discussing  functional  accuracy,  not  numer¬ 
ical  accuracy.  This  is  a  step  higher  than  what  was  used  in  the 
Raleigh-Ritz  method,  wherein  the  criteria  was  only  the  accu¬ 
racy  of  the  eigenvalue.  At  the  same  time,  there  is  no  one  single 
number  that  can  represent  “functional  accuracy”.  The  purpose 
of  this  Letter  is  to  present  a  method  for  quantitatively  estimating 
the  functional  error  in  any  VA  by  using  only  linear  techniques 
and  without  having  the  exact  solution  at  hand.  By  utilizing  the 
functional  accuracy,  one  will  be  able  to  determine  the  numerical 
accuracy  in  any  quantity  of  interest. 

The  method  of  the  VA  is  quite  straightforward:  establish 
the  Lagrangian  for  the  governing  system,  choose  a  reasonable 
trial  function  (ansatz)  containing  certain  arbitrary  parameters, 
calculate  the  action  for  that  ansatz  and  then  determine  the  op¬ 
timum  parameters  of  the  ansatz  by  solving  the  Euler-Lagrange 
equations  [25].  Given  an  optimized  ansatz,  one  can  then  quan¬ 
titatively  estimate  the  error  in  the  VA  without  needing  the  exact 
solution  To  do  this,  all  that  is  required  is  for  one  to  be  able  to 
solve  a  linear  problem,  to  be  defined  below. 

To  illustrate  this  method,  two  physical  systems  will  be  ana¬ 
lyzed  These  example  systems  have  been  chosen  to  be  simple 
so  that  the  method  can  be  easily  illustrated  These  examples  arc 
also  only  one-dimensional  for  the  same  reason.  However,  it  is 
clear  that  an  extension  of  this  method  to  multidimensional  prob¬ 
lems  can  be  developed,  although  it  could  possibly  be  fraught 
with  additional  difficulties.  But  here,  the  first  system  that  we 
shall  study  is  the  fundamental  mode  of  a  vibrating  string.  Using 
a  sawtooth  ansatz  and  a  parabolic  ansatz,  we  will  demonstrate 
how  to  quantitatively  determine  how  “close”  the  VA  is  to  the 
correct  solution.  The  second  model  will  be  the  well-known  soli- 
ton  solution  of  the  Korteweg-de  Vries  (KdV)  equation.  Here 
we  will  use  a  Gaussian  ansatz,  and  demonstrate  how  to  quanti¬ 
tatively  estimate  two  types  of  errors  which  this  VA  could  intro¬ 
duce  into  another  calculation.  In  one  case,  the  Gaussian  ansatz 
is  an  acceptable  approximation  while  in  the  other  case,  it  is  un¬ 
acceptable. 

2.  Theory 

Let  us  first  outline  the  theory  behind  this  method  of  quanti¬ 
tatively  estimating  the  error  in  a  VA.  Suppose  the  Lagrangian, 
L,  corresponding  to  the  physical  system  of  interest  is  known 
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and  that  this  Lagrangian  is  a  functional  of  the  variable  u(i).  The 
variable  u(l)  may  be  a  scalar,  vector  or  tensor  quantity.  For  now 
we  shall  only  consider  a  one-dimensional  scalar  case,  where  the 
integration  variable  is  i. 

The  action,  S[u ],  is  defined  by 


5  =  j  Uu]dt ,  (I) 

V 

where  T>  is  the  domain  of  support  of  the  function  u(i).  Hamil¬ 
ton’s  Principle  states  that  the  governing  equations  of  the  vari¬ 
able  u(t)  will  be  such  that 


Slu(l)  + 6*11(0] -S[«(0]  f8L  r  n 

lim - =  /  —  n(f)  \&u(l)at  =  0, 

e— 0  €  J  8u L  J 

V  (2) 

for  any  bounded  variation  6w(/),  provided  that  this  variation 
vanishes  at  all  end  points  of  the  domain  V .  Note  that  this  also 
defines  the  quantity  ^£(f),  which  is  called  the  (first)  variational 
derivative  of  L. 

Now,  let  us  define  the  (nonlinear)  operator  <P  to  be  the  first 
variational  derivative  of  L: 


<*>[•]  = 


8u 


This  gives  us  the  Euler-Lagrange  equation. 


4>lue]  =  0, 


(3) 


(4) 


where  ue  will  be  the  exact  solution(s).  (There  could  be  more 
than  one  solution,  depending  on  the  boundary  conditions  im¬ 
posed  on  ue.)  Thus,  the  problem  is  to  find  ue.  Let  us  now  use 
Hamilton’s  Principle  to  obtain  a  variational  approximation  to 
ue(t).  The  problem  is  then  to  find  the  “best”  solution  for  what¬ 
ever  ansatz  is  chosen  (the  VA)  and  then  to  obtain  a  quantitative 
estimate  of  its  resulting  error(s). 

As  mentioned  earlier,  we  want  to  expand  u€  in  a  perturba¬ 
tion  expansion  about  the  ansatz  plus  some  error.  Let  wo(f;  q)  be 
the  ansatz,  where  q  is  some  finite  collection  of  parameters,  on 
which  uq  is  dependent.  Then  we  should  expand  ue  as 


Hue]dt 


j  L[uo  +  eu\  +€2U2]dt  +  0(e3) 
V 


L[u0]dt  +€ 


(w  i  +€uz)dt 
o 


://(« ^)0'<.<'>»,(<W,'  +  o(f>). 


V  f) 


(6) 


where  the  subscript  “0”  indicates  evaluation  at  u  =  wo •  Now 
consider  the  expression  above.  By  Hamilton’s  Principle,  we  are 
to  find  a  ue  such  that  for  small  variations  about  wf,  we  will 
have  8S  =  0.  In  making  this  variation,  we  can  only  vary  the 
parameters  q  and  the  functions  u\ ,  U2>  etc. 

Consider  the  leading  order  term  in  Eq.  (6).  Since  the  func¬ 
tional  form  of  uo  has  been  fixed,  we  can  only  vary  the  parame¬ 
ters  q.  In  lowest  order  (e  =  0),  variations  of  the  q' s  will  give  the 
set  of  equations 


^  =  L[Uo]dt=0'  (7) 
H  V 

which  determine  the  lowest  order  solution  (and  structure)  of  the 
<7’s.  Once  this  is  done,  we  have  the  <7’s  determined,  which  is  the 
result  of  applying  the  VA. 

Let  us  consider  the  next  order,  the  e  term.  By  varying  the 
function  u\  we  can  impose  additional  linearly  independent  vari¬ 
ations,  beyond  what  was  imposed  in  zeroth  order.  Varying  u\  in 
•the  6  term  will  give  us: 


which  is  nothing  more  than  the  cquation(s)of  motion  evaluated 
at  the  ansatz  (u  =  uo)  However,  unless  uo  happens  to  be  an 
exact  solution,  it  is  impossible  for  this  condition  to  be  satisfied. 
But  it  can  be  almost  satisfied  if  uo  is  close  to  an  exact  solution, 
in  which  case  the  left-hand  side  of  (8)  will  actually  be  “small” 
but  nonzero.  And  since  it  is  nonzero,  it  must  be  balanced  by 
something.  The  only  possible  terms  that  could  balance  it  are 
the  next  higher  order  terms,  which  are  the  €2  terms  in  (6).  This 
we  can  naturally  do  if  we  take  the  size  of  6  to  be  determined  by 
the  order  of  magnitude  of  (f~)o-  Since  (8)  cannot  generally  be 
satisfied,  wc  then  must  take 


ue  =  uo(t\q)  +  €u\(t)  +  e2U2(t)-\ - .  (5) 

where  e  is  an  expansion  parameter  whose  size  will  be  defined 
shortly.  The  collection  of  parameters,  q,  on  which  the  trial  func¬ 
tion  is  dependent,  is  assumed  to  be  finite  and  could  include  an 
amplitude,  width,  position,  velocity,  chirp,  phase,  etc.,  depend¬ 
ing  on  the  problem.  The  variation  of  wo  will  then  consist  only  of 
the  changes  that  occur  in  wo  when  these  parameters  are  varied. 
In  varying  these  parameters,  from  (2),  we  will  obtain  a  finite 
number  of  independent  variations  of  S. 

Let  us  observe  what  happens  when  we  use  this  expansion. 
Substituting  the  expansion  given  by  (5)  into  (1),  and  then  ex¬ 
panding  about  w  =  uo,  yields 


«9' 

which  defines  the  functional  R[q]  and  at  the  same  time  also 
defines  €.  Inserting  (9)  into  (6)  then  gives  us  an  expansion  in 
which  no  €  terms  are  present.  Rather,  the  next  lowest  order 
terms  are  the  f2  terms  and  upon  varying  u\  in  these  e2  terms, 
one  obtains 

V 

Note  that  this  equation  is  linear  in  u  \ ,  and  thus  to  determine  w  i , 
one  only  has  to  solve  a  linear  nonhomogeneous  problem. 
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In  hindsight,  this  result  is  not  surprising.  Although  we  have 
used  Hamilton’s  Principle  to  derive  this  result,  one  just  as  easily 
could  have  derived  it  from  the  Euler-Lagrange  equations.  From 
(3H5),  upon  using  (9),  one  can  easily  obtain  (10).  One  can  also 
interpret  what  is  occurring  here,  and  give  a  simple  statement  of 
its  procedure.  One  starts  with  an  ansatz,  wo»  and  uses  the  VA 
to  calculate  the  best  values  for  its  parameters.  Then  we  calcu¬ 
late  the  first-order  correction  to  the  ansatz,  u j,  based  on  the 
linearly  perturbed  Euler-Lagrange  equations,  (10).  The  small¬ 
ness  of  various  features  of  this  correction  will  be  a  basis  for  a 
quantitative  measure  of  the  validity  of  the  VA. 


For  the  system  described  in  (11),  we  may  take 


L(it'Ux)  =  ^ 


du\ 
dx ) 


(12) 


where  v  =  k 2  is  the  eigenvalue.  Note  that  due  to  translational 
invariance,  we  only  need  to  treat  the  case  of  the  fundamental 
mode,  since  the  nth-order  mode  for  a  string  of  length  £,  can  be 
obtained  from  the  fundamental  mode  of  a  string  of  length  £//t. 


3  1.  The  variational  approximation 


3.  The  vibrating  string 

The  vibrating  string  of  a  finite  length  £,  fixed  at  both  ends, 
has  a  simplicity  which  allows  one  to  readily  understand  the  ap¬ 
proach  used.  If  we  decompose  its  evolution  into  normal  modes, 
then  we  may  use  a  VA  to  determine  the  modal  shapes  and  the 
modal  eigenfrequencies.  The  modal  problem  is  a  linear  eigen¬ 
value  problem,  with  a  well  known  solution.  We  take  the  Euler- 
Lagrange  equation  to  be 


*[«(*)]  =  0.  <*>!•]  =  ^y[»]  +  *2M.  (11) 

where  <f>[»]  is  the  (linear)  operator,  k 2  is  the  eigenvalue,  the  in- 
terval  is  [0  ^  x  ^  £]  and  the  integration  variable  is  now  changed 
to  x.  Our  boundary  conditions  are  u( 0)  =  0  =  w(£). 

Since  this  problem  is  linear,  the  amplitude  of  the  solution 
is  arbitrary.  To  eliminate  this  irrelevant  degree  of  freedom,  we 
shall  require  the  norm  of  the  eigenmode,  m(jc),  to  be  unity. 


We  shall  consider  two  different  approximations  for  the  fun¬ 
damental  eigenfunction  of  (11):  a  sawtooth  approximation  and 
an  upside-down  parabola,  each  of  which  is  given  by 


(sawtooth) 

W0 


(x\AJ) 


^x. 

Y-(e-x). 


0  <  jc  < 

|  ^  jc  <  £, 


(13) 

(14) 


where  A  is  the  amplitude  of  the  eigenfunction.  Requiring  the 
norm  of  each  ansatz  to  be  unity,  gives 


^(sawtooth)  _ 


^(parabolic)  __ 


(15) 


Diagrams  of  both  ansatze  are  show  n  in  Fig  1. 

As  outlined  in  Section  2,  once  we  have  the  ansatz,  we  then 
evaluate  the  action  for  that  ansatz.  However  for  eigenvalue 
problems,  it  is  standard  to  set  up  the  variational  prohlem  with 
the  “action”  being  the  eigenvalue  [  1 4, 1 5].  Taking  this  approach. 


Sawtooth  Ansatz 


0.0 


0.2 


0.4  n  0.6 


0.8 


1.0 


Sawtooth  Ansatz  Error 


Parabolic  Ansatz  Error 


Fig.  1.  The  above  figures  depict  two  variational  approximations  for  the  vibrating  string  vs.  p  -  x ft.  On  the  left  is  the  variational  ansatz,  its  first-order  correction, 
and  the  sum.  On  the  right  we  have  plotted  the  difference  between  the  exact  solution  and  the  sum  of  variational  ansatz  and  the  first  order  correction. 
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we  then  vary  the  eigenvalue,  v,  where 


v  = 


Jq  u2dx 


(16) 


It  is  trivial  to  verify  that  varying  u  and  requiring  the  action 
to  be  an  extremum  then  gives  us  (11).  Inserting  (13)  and  (14) 
into  (16)  respectively,  we  obtain 


i4“th)  =  -^=1.216...7r2/£2,  (17) 

v(p«abolic)  _  12  _  j  0,3  n2/e2.  (18) 


In  this  case,  the  only  parameter  was  the  amplitude,  which  was 
set  by  the  normalization  condition  Furthermore,  since  (11)  is 
linear,  the  amplitude  drops  out  anyway  and  the  action  has  only 
a  fixed  value  and  no  further  variation  can  be  done.  Thus,  the  VA 
for  the  eigenvalue  for  each  case  takes  on  the  value  shown  above. 
As  for  accuracy,  we  see  that  the  sawtooth  ansatz  is  less  accu¬ 
rate  with  the  eigenvalue  within  22%  of  the  exact  value.  On  the 
other  hand,  the  parabolic  approximation  is  much  better  giving 
an  eigenvalue  that  is  within  1 .3%  of  the  correct  value.  However, 
we  know  this  only  because  we  are  comparing  with  the  exact  so¬ 
lution.  Let  us  now  see  how  we  can  quantify  the  level  of  accuracy 
without  knowing  the  correct  answer. 


3  2.  The  correction 


In  solving  for  the  first-order  correction  to  an  eigenvalue 
problem,  there  will  also  be  an  eigenvalue  shift  as  well  as  a  shift 
in  the  function.  Rather  than  rederiving  our  result  for  eigenvalue 
problems,  we  may  simply  recognize  that  this  is  the  case  and 
that  a  shift  is  needed.  Then  directly  using  (9)  and  (10),  we  ob¬ 
tain: 


f  ( T~TTTT~nl  “l  (x)dx  +  R(x)  4- &vu0(x)  =  0,  (19) 

J  \Su(x)Su(x')/ o 

V 

where  we  have  taken  <5 v  to  be  the  shift  in  the  eigenvalue  and 
defined 


d^UQ 

R(x)  =  —j-  + 

Since  uo  =  k2,  then  in  general 
v  =  k2  4*  8v, 

in  which  case,  from  (12),  (19)  becomes 

d2u\  ? 

4-  k^u\  —  U(x ), 

where 

U(x)  =  —  R(x)  —  8vuq(x). 


(20) 


(21) 


(22) 


(23) 


Eq.  ( 1 9)  is  a  linear  problem,  even  when  the  analogy  of  ( 1 1 )  is 
nonlinear.  Thus,  we  can  always  solve  it  with  linear  techniques, 
and  in  this  case,  the  solution  of  (22)  is 


u  I  =  -iC+eikx -iC-e~ikx 


/ 1,,,.,  *-<»<* -''ly. 


(24) 


0 

where  C±  are  the  two  constants  of  integration.  As  one  may  ver¬ 
ify  by  integrating  by  parts. 


A 

I 


U(x) 


,  sin(A:(jf  —  x')) 


dx 


du0  sin(kx) 

=  — - - u0(x) 

dx  k 


A 

—  J  uo(x')sin(k(x  •±xf))dx. 


(25) 


Now  we  impose  our  boundary  conditions  and  the  normaliza¬ 
tion  condition.  At  the  left  end,  w(0)  =  0  gives  C+  =  —  C_.  At 
the  right  end,  u(t)  =  0  gives 

C+  =  -C- 


1  duo 

=  -2kl7(0) 


a 

+  2 i^U)I  U0lX,)^{k{t-X'))dX'-  (26) 

0 


Imposing  the  condition  that  the  norm  be  unity  to  order  e  will 
give 


t 

J  uo(xf)u\{x,)dxt  =  0.  (27) 

o 

which  along  with  (26)  will  give  us  the  value  for  <$u. 

Let  us  now  consider  our  two  ansatze.  With  the  above,  it  is 
straightforward  to  evaluate  the  integrals  analytically  as  well  as 
numerically.  We  will  not  give  the  analytical  expressions  due  to 
their  complexity.  The  numerical  values  are 

C£awioo*)_00873  A 

(sawtooth)  _  _0  2,9  .jpffi,  (28) 


^pantbolic)  =  _0 ,19  A 

,5 (Parabolic)  _  _()  0|  3.  .  .  n2 / P  (29) 

whereby,  we  note  that  the  corrections  to  the  eigenvalues  are  in¬ 
deed  essentially  what  we  had  estimated  from  our  knowledge  of 
the  exact  solution.  Thus  8v  does  indeed  give  a  quantitative  esti¬ 
mate  of  the  accuracy  of  these  two  VAs.  As  we  had  pointed  out 
earlier,  the  parabolic  ansatz  is  more  accurate  than  the  sawtooth 
and  the  above  numbers  give  a  quantitative  justification  of  that 
statement. 

Naturally,  the  corrected  eigenvalues  will  be  more  accurate, 
and  these  values  are 


(v0  +  5u)<sawtoo,h)  =  0.99675. . .  jt2/(2, 
(u0  +  5v)(Par,bolSc»  =  0.99998. . .  iPjt 2. 


(30) 
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which  gives  an  almost  five  9s  agreement  for  the  corrected  par¬ 
abolic  approximation. 

The  correction  to  an  eigenvalue  is  just  one  possible  quanti¬ 
tative  estimate  of  VA  accuracy.  Another  quantitative  estimate 
is  the  RMS  error  of  the  ansatz.  Taking  the  above  expressions 
for  (24)  and  defining  the  relative  RMS  error,  Em**  by 


Fi  _  fo(m)2dx 

m  /0W^' 

we  find  that  for  each  of  our  approximations. 


(31) 


£masW,0Oth)  =  o.  1 24. . . ,  =  0.038. . . .  (32) 

Obviously,  again  the  parabolic  ansatz  is  the  most  accurate. 


4.  The  Korteweg-de  Vries  equation 


This  equation  is  a  well-known  nonlinear  equation  [26]  and 
we  will  use  it  to  demonstrate  how  one  can  obtain  estimates  of 
the  error  of  a  VA  in  a  nonlinear  system.  We  shall  apply  this 
method  to  finding  soliton  (localized  traveling  wave)  solutions 
of  the  Korteweg-de  Vries  (KdV)  equation,  which  is 

u,  +  6uux  +  uxxx  =  0.  (33) 

Soliton  solutions  of  the  form  w(jc,  /)  =  U (£ )  where  £  =  x  —  ct, 
exist  where  c  is  the  wavespeed.  The  amplitude  of  such  a  wave, 
U ,  then  satisfies  the  ODE 

-cUs  +6  U$U  +  =  0.  (34) 

Since  these  soliton  solutions  vanish  as  £  — ►  ±oc,  we  may  inte¬ 
grate  this  equation  once  to  obtain  the  equation 

—cU  4-  3U2  +  =  0.  (35) 

From  (35)  one  may  construct  the  Lagrangian 

L  =  —^cU2  +  £/3  —  ^(U()2.  (36) 

Let  us  take  a  Gaussian  ansatz  of  the  form 

Uq($)  =  A  exp (— f  2/p2),  (37) 

where  Uo  has  two  parameters:  an  amplitude  A  and  a  width  p. 
Given  this  ansatz,  we  may  construct  the  action,  5: 

+  OC 

S=  J  i.[n0(*)]<tf 

-oo 

\^2n  /  7  4  A2\ 

=  -—{cA2P^A2P  +  -).  (38) 

Note  that  in  this  case,  c  is  a  control  parameter  and  is  not  to 
be  varied.  Varying  the  action  with  respect  to  the  trial  function 
parameters  will  result  in  two  Euler-Lagrange  equations  whose 
solution(s)  will  give  “the  closest  fit”  to  the  exact  solution.  These 
equations  reduce  to 

c  c 

—  =  0  =  cp2  —  V6Ap2  +  1 ,  (39) 

oA 


8S  ,  4  ,  , 

Sp  v6 

whose  solutions  are 


(40) 


(41) 


4. 1.  First-order  correction 

We  shall  now  calculate  the  first-order  correction  to  the  vari¬ 
ational  ansatz.  As  before,  we  take  an  expansion  as  in  (5), 

Ue  =  Uo(S:q)  +  €Ui$)^^  (42) 

and  insert  it  into  (35).  In  the  above,  e  is  a  small  ordering  pa¬ 
rameter,  Uq(£)  is  the  variational  ansatz  and  U i(f)  is  the  first- 
order  correction.  Again  we  note  that  if  (37)  together  with  (41) 
is  a  reasonably  accurate  approximation,  then  wc  would  expect 
the  quantity 

eR^)  =  -cU0  +  3U2  +  U0.((.  (43) 

to  be  quite  small.  This  then  should  generate  a  reasonably  small 
U |,  which  would  be  determined  by  the  linear  problem 

-cUx  +  (>UoU\+UxM  =-/?«).  (44) 

The  problem  of  solving  (44)  can  be  simplified  if  we  scale 
our  coordinates  and  amplitudes  to  be  unitless.  Let  us  define  a 
new  coordinate,  y,  and  new  amplitudes,  Hq  and  H\%  where 

f  =py<  HQ(y)=cU0^h  Hi(y)=cU\(!-).  (45) 

in  which  case  (43)  and  (44)  become 

d2H\  .  r  v2 

+  (6v/fc--'  — 5)W,  =-R'(y).  (46) 

where 

R'  =  ^“y2[(4>'2  -  7)  +3>/6e-yJ].  (47) 

The  boundary  conditions  on  H\  are  such  that  H\  -*•  0  as  y  — ► 
dhoc  There  will  be  two  homogeneous  solutions  of  (47)  and  one 
particular  solution.  The  two  homogeneous  solutions  approach  a 
linear  combination  of  exp(± V5y)  as  y  — ►  ±oo  The  boundary 
conditions  will  give  a  unique  solution  for  H\ ,  provided  that  nei¬ 
ther  homogeneous  solution  is  a  bound  state  (which  is  the  case). 

Since  this  ODE  does  not  seem  to  have  an  analytical  solution, 
it  has  been  solved  numerically.  Using  the  shooting  method,  we 
integrate  (46)  from  a  large  negative  value.  We  have  to  integrate 
two  homogeneous  solutions  and  one  particular  solution  and  find 
the  appropriate  linear  combination  at  the  right  end  such  that  the 
total  solution  vanishes  as  one  approaches  positive  infinity.  The 
result  is  the  unique,  bounded  solution  shown  in  Fig.  2. 

In  Fig.  3,  we  show  the  ansatz  Hq  compared  to  the  sum 
Hq  4-  H\.  As  one  can  see,  the  first-order  correction  mainly 
corrects  for  the  more  rapid  decay  of  the  Gaussian  at  large  y. 
In  Fig.  4,  we  show  the  difference  between  the  exact  solution, 
He,  and  Hq  +  H |,  which  will  indicate  what  the  second-order 
correction  would  be.  Here  we  note  that  the  largest  remaining 
correction  is  in  the  central  region  of  the  pulse.  [Since  the  KdV 
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Fig.  2.  The  first-order  correction,  H[(y).  lo  ihe  Gaussian  ansatz. 


Hq  and  H)  +  Hi  vs.  y 


Fig.  3.  Hq{\)  and  H0(y)  +  //t(.v). 


He-Ho-H!  vs.  y 


Fig.  4  The  exact  solution,  //*.(>),  minus  the  sum  of  the  ansatz  and  the  first-order 
correction,  Hq(v)  4-  H\  (y). 


Fig.  5.  A  log(H)  plot  showing  the  amplitudes  of  the  tails  vs.  the  width  at 
half-height,  with  Hq  being  the  lower  curve  and  with  Hq(x)  +  H\(y)  being 
the  upper  curve  The  dashed  line  is  an  example — see  the  text. 

As  already  mentioned,  there  are  several  ways  in  which  one 
could  quantitatively  measure  the  accuracy  of  an  ansatz,  de¬ 
pending  on  the  quantity  in  which  one  was  interested.  We  shall 
consider  two  possibilities.  First,  the  standard  rms  error  (31) 
can  again  be  calculated.  This  would  be  appropriate  as  a  mea¬ 
sure  of  the  global  error  in  the  pulse  shape.  Here  we  find  that 
^Vms  *=  0  .0380. . . ,  which  indicates  that  this  is  quite  good  for  a 
global  approximation.  However,  if  one  were  interested  in  cal¬ 
culating  soli  ton-soli  ton  interactions  wherein  the  overlap  of  the 
tails  were  important,  then  the  situation  would  be  different.  So 
the  quantitative  measure  of  a  VA  will  depend  on  the  application 
to  which  it  will  be  applied. 

Let  us  now  be  definite  and  calculate  a  quantitative  measure 
for  this  case.  For  the  Gaussian  approximation,  we  have  from 

Fig.  3  that  the  width  at  half-height  is  w\/ 2  =  1  594 _ If  one 

were  to  plot  the  tail  amplitude  versus  this  width,  as  in  Fig.  5, 
one  would  readily  see  that  the  Gaussian  ansatz  is  an  inferior 
approximation  for  the  tails.  For  example,  if  one  chose  to  have 
the  solitons  separated  such  that  the  tail  amplitude  in  the  region 
of  the  next  soliton  would  be  no  more  than  0.001,  the  dashed 
line  in  Fig.  5  shows  that  a  soliton-soliton  separation  of  only 
1.56  widths  would  be  sufficient.  However  if  we  add  the  first- 
order  correction  to  the  ansatz.  Ho  +  H\%  which  is  also  shown 
in  Fig.  5,  then  we  would  come  to  a  different  conclusion.  The 
curve  Ho  T  H\  shows  that  the  ansatz  plus  correction,  at  a  width 
of  1.56,  would  have  an  amplitude  of  ^  0.008,  which  is  eight 
times  too  large.  Looking  at  where  the  separation  should  actu¬ 
ally  be,  we  see  that  the  tails  of  Hq  4*  H\  do  not  decay  to  0.001 
until  at  least  2.1  widths.  Thus  since  the  ratio  of  these  widths 
is  2.1/1.59  =  1.32,  a  measure  of  the  VA  accuracy  in  this  case 
would  be  ±32%,  considerably  more  than  the  3.8%  indicated  by 
the  rms  error. 

5.  Conclusion 


equation  is  nonlinear,  after  the  first-order  correction  we  would 
have  a  nonzero  second-order  correction,  which  would  therefore 
be  largest  wherever  the  amplitude  was  largest.  Here  we  see  that 
this  is  indeed  the  case.] 


In  this  Letter  we  have  established  a  method  for  quantify¬ 
ing  the  accuracy  of  variational  approximations  which  does  not 
require  an  exact  solution  of  the  governing  equation(s)  of  mo- 
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tion.  The  methcxi  possesses  an  inherent  simplicity  in  that  the 
measure  of  accuracy  is  obtained  via  solving  an  inhomogeneous 
linear  equation. 

We  have  also  pointed  out  that  the  quantitative  measure  of  any 
variational  approximation  must  be  based  on  the  intended  use  of 
the  solution.  Depending  on  the  use,  the  VA  could  either  be  quite 
acceptable  or  could  even  be  unacceptable  The  smallness  of  the 
correction,  or  the  smallness  of  its  deviation  from  some  impor¬ 
tant  property  of  the  exact  solution,  should  be  the  quantitative 
measure  of  the  validity  of  any  VA. 

This  work  opens  up  additional  approaches  to  the  utilization 
of  variational  approximations  as  a  modeling  technique  for  phys¬ 
ical  and  mathematical  systems.  There  is  no  longer  the  necessity 
for  having  exact  (or  close  to  exact)  solutions  a  priori  in  order 
to  determine  the  validity  of  a  variational  approximation.  The 
advantage  of  this  for  nonlinear  problems  is  that  to  obtain  an  esti¬ 
mate  of  the  correction,  one  only  needs  to  solve  a  linear  problem, 
not  the  fully  nonlinear  problem.  The  steps  in  this  method  for 
quantifying  the  quality  of  a  VA  consists  of  choosing  an  analyti¬ 
cal  model  for  the  ansatz  which  will  be  close  to  that  of  the  exact 
solution,  obtain  a  VA  from  that  ansatz  and  use  the  technique 
outlined  here  to  determine  the  accuracy  of  those  results.  This 
latter  is  accomplished  by  estimating  the  first-order  correction 
to  the  VA.  This  first-order  correction  may  then  be  used  to  also 
improve  upon  the  predictions  of  the  VA.  Naturally,  one  could 
develop  an  iterative  process  from  this  for  improving  the  accu¬ 
racy  of  a  variational  technique  to  higher  orders. 

Note  added  in  proof 

It  needs  to  be  mentioned  that  an  earlier  method  to  estimate 
the  validity  of  a  variational  approximation  had  been  given  in 
Ref,  [28],  where  Dexais,  Anderson  and  Lasik  investigated  how 
well  a  variational  solution  would  preserve  the  next  higher  order 
invariant  beyond  the  Hamiltonian. 
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